Skip to content

This is a repository for binary classifiers (LR and GB), used for prediction on a real-life dataset. The model building included data cleaning, feature selection and hyperparameter tuning.

Notifications You must be signed in to change notification settings

sanjayc2/Prediction_Gradient_Boosting_Classsifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

Prediction_Gradient_Boosting_Classsifier

This is a prediction of the likelihood of a ticket being paid, using a real-life dataset. It was done for a Coursera project. The model building involved data exploration and cleaning (using pandas), feature engineering (selction, transformation), model training and validation. Two binary classifiers were developed - a linear classifier (logistic regresssion w/ L1 regularalizer) and a non-linear classifier (gradient-boosted decision trees). The hyperparameters were tuned using the validation set. The model's ROC-AUC was close to 80%, which is close to the highest that has been obtained on this data set.

About

This is a repository for binary classifiers (LR and GB), used for prediction on a real-life dataset. The model building included data cleaning, feature selection and hyperparameter tuning.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published