This project implements a Gradient Boosting Classifier to predict car purchase likelihood using a Japanese dataset and applies the model to estimate potential customers in an Indian dataset. The Jupyter Notebook (Car_Purchase_Prediction.ipynb) includes data preprocessing, feature engineering, model training, evaluation, business insights, and a Tableau visualization plan.
- Japanese Dataset:
JPN Data.xlsx - CN_Mobiles.csv - Indian Dataset:
IN_Data.xlsx - IN_Mobiles.csv
pip install pandas sklearn seaborn matplotlib- Requires Python 3.8+ and Jupyter Notebook.
- Place dataset files in the project directory.
- Clone the repository:
git clone https://github.com/itsbk13/Car_Purchase_Prediction.git cd Car_Purchase_Prediction - Run
Car_Purchase_Prediction.ipynbin Jupyter to execute the full workflow. - Use exported
japan_data.csvandindia_data.csvfor Tableau visualizations.
Car_Purchase_Prediction/
├── JPN Data.xlsx - CN_Mobiles.csv # Japanese dataset
├── IN_Data.xlsx - IN_Mobiles.csv # Indian dataset
├── Car_Purchase_Prediction.ipynb # Main notebook
├── japan_data.csv # Exported Japanese data
├── india_data.csv # Exported Indian data
├── output.csv # Indian predictions
└── README.md # This file
- Model: Gradient Boosting Classifier (70.26% accuracy).
- Key Predictors: Car age (>360 days) and income.
- Business Insight: Target high-income individuals with older cars.
- Tableau: Visualize age, income, and purchase trends.
- This project is licensed under the MIT License.
- Attribution: This project was built as a capstone during an internship program with Internshala.