Skip to content

Avocado Price Prediction Project for Machine Learning course. In this repository, you can find the dataset and source code. Everything was done according to the requirements.

Notifications You must be signed in to change notification settings

SananMuzaffarov/Avocado-Price-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Avocado Price Prediction Project

This project focuses on analyzing and predicting avocado prices in the United States using historical sales data. We applied data science and machine learning techniques to identify trends, patterns, and future prices.

πŸ“Š Dataset

  • Source: Hass Avocado Board via Kaggle
  • File: Avocado.csv
  • Features Include:
    • Date: The date of the observation
    • AveragePrice: Average price per avocado
    • Total Volume: Total number of avocados sold
    • 4046, 4225, 4770: PLU (Product Lookup) codes representing different avocado types
    • Total Bags, Small Bags, Large Bags, XLarge Bags
    • type: Conventional or organic
    • year: Year of observation
    • region: Geographic market region in the U.S.

πŸ“ˆ Objectives

  • Explore and clean avocado price data.
  • Perform exploratory data analysis (EDA).
  • Build regression models to predict avocado prices.
  • Visualize results and insights.
  • Evaluate and compare model performance.

πŸ§ͺ Methods & Tools

  • Environment: Jupyter Notebook
  • Libraries Used:
    • Pandas & NumPy (data manipulation)
    • Matplotlib & Seaborn (visualization)
    • Scikit-learn (machine learning)
    • Statsmodels (statistical analysis)

πŸ” Key Highlights

  • Detailed EDA including region-wise trends and yearly price changes.
  • Applied multiple regression techniques including:
    • Linear Regression
    • Random Forest Regressor
    • Decision Tree
  • Included data preprocessing steps like encoding, feature scaling, and train-test split.
  • Visualized prediction results to evaluate model accuracy.

πŸ“ Files

  • AvocadoGroupProject.ipynb: Main notebook containing code, analysis, and model building.
  • Avocado.csv: Dataset used for training and evaluation.

πŸš€ How to Run

  1. Clone the repository:
    git clone https://github.com/yourusername/avocado-price-prediction.git
    cd avocado-price-prediction
    
  2. Install the required packages:
    pip install -r requirements.txt
    
  3. Launch the notebook:
    jupyter notebook AvocadoGroupProject.ipynb

About

Avocado Price Prediction Project for Machine Learning course. In this repository, you can find the dataset and source code. Everything was done according to the requirements.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published