Internship Project at SaiKet Systems
π Objective:
This project analyzes customer churn patterns, builds predictive models, and provides actionable insights to improve customer retention.
β
Performed Data Cleaning & Processing
β
Exploratory Data Analysis (EDA) to Identify Churn Patterns
β
Built Machine Learning Models for Churn Prediction
β
Evaluated Model Performance Using Accuracy, Precision, Recall, and F1-Score
β
Provided Business Recommendations to Reduce Churn
- Google Colab β‘
- Python π
- Pandas, NumPy, Matplotlib & Seaborn π
- Scikit-Learn π€
- Power BI π (For Data Visualization)
Model | Accuracy | Precision | Recall | F1-Score |
---|---|---|---|---|
Logistic Regression | 0.369 | 0.278 | 0.869 | 0.422 |
Decision Tree | 0.656 | 0.429 | 0.912 | 0.584 |
Random Forest | 0.735 | 0.000 | 0.000 | 0.000 |
- Logistic Regression has high recall (0.869), meaning it identifies churned customers well, but low precision.
- Decision Tree performs better overall, with higher accuracy (0.656) and balanced recall (0.912).
- Random Forest achieves the highest accuracy (0.735) but fails in precision and recall (0.0), possibly due to class imbalance.
πΉ Fix Class Imbalance: Use SMOTE (Synthetic Minority Over-sampling Technique) or re-sampling techniques.
πΉ Tune Hyperparameters: Use GridSearchCV for Decision Tree & Random Forest.
πΉ Try Other Models: Experiment with XGBoost, SVM, or Neural Networks for better performance.
π Main Notebook: PROJECT_UNDER_SAIKET_SYSTEM.ipynb