This is a prediction of the likelihood of a ticket being paid, using a real-life dataset. It was done for a Coursera project. The model building involved data exploration and cleaning (using pandas), feature engineering (selction, transformation), model training and validation. Two binary classifiers were developed - a linear classifier (logistic regresssion w/ L1 regularalizer) and a non-linear classifier (gradient-boosted decision trees). The hyperparameters were tuned using the validation set. The model's ROC-AUC was close to 80%, which is close to the highest that has been obtained on this data set.
-
Notifications
You must be signed in to change notification settings - Fork 0
This is a repository for binary classifiers (LR and GB), used for prediction on a real-life dataset. The model building included data cleaning, feature selection and hyperparameter tuning.
sanjayc2/Prediction_Gradient_Boosting_Classsifier
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This is a repository for binary classifiers (LR and GB), used for prediction on a real-life dataset. The model building included data cleaning, feature selection and hyperparameter tuning.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published