This repository contains a project for predicting crop yield using machine learning models (XGBoost, LSTM, Random Forest) based on climatic and economic variables.
-
📓 Notebooks – Colab
.ipynb
files for data analysis, model training, and evaluation:ApplesProjectEDA.ipynb
– Exploratory Data Analysis and preprocessing.ApplesProjectModels.ipynb
– Model training and evaluation.
-
📈 Results – Performance comparisons and visualizations.
-
📁 Data – Dataset files (not included in the repository, see details below).
- ✅ Descriptive statistics (mean, median, standard deviation).
- ✅ Data distribution tests (Histogram, Q-Q Plot, Shapiro-Wilk, Pearson).
- ✅ Models: XGBoost, LSTM, Random Forest.
- ✅ Evaluation using RMSE vs. STD.
Model | Test STD | RMSE | Zone / Approach |
---|---|---|---|
XGBoost | 2.497 | 2.244 | Individual zone – Yonatan |
XGBoost + Econ | 2.071 | 1.364 | Combined zones + economic data |
LSTM | 2.497 | 2.316 | Individual zone – Yonatan |
Random Forest | 2.532 | 2.271 | Combined zones + economic data |
- XGBoost with combined features outperformed other models.
- RMSE significantly lower than STD indicates high prediction accuracy.
- LSTM and Random Forest underperformed in this specific scenario.
git clone https://github.com/Dangutman98/Apple-Yield-Climate-Research.git
cd Apple-Yield-Climate-Research
- Open
ApplesProjectEDA.ipynb
and run it step-by-step for EDA & preprocessing. - Then run
ApplesProjectModels.ipynb
for model training and evaluation.
ℹ️ Note: The datasets are loaded directly from the authors' Google Drive during notebook execution. You do not need to upload any files manually. Simply run the notebook cells as-is.
- Run
ApplesProjectEDA.ipynb
(mandatory for data preparation). - Run
ApplesProjectModels.ipynb
(after EDA is completed).
- Dan Gutman
- Tal Krispin
- Shahar Ben Laiche
- Volcani Institute – Israel's national powerhouse of agricultural innovation.
- The dataset files are not included in this repository due to size constraints.
- The notebooks are pre-configured to fetch the required files from a private Google Drive. Ensure you have access via the shared links or modify paths accordingly.