Skip to content

wessware/Boosting_Algorithms

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Statement Problem

Allocating loans to customers with an assurance that they will payback is serious problem for many seasoned and upcoming fintech companies. This machine learning approach intends to predict [classify] customers as credit worth or not by assigning them a credit score between 0 and 1 and a payback probability. With a proper tuning, this model could be the awaited solution to help companies avoid losses in money lending.

Boosting_Algorithms

Algorithms that use boosting techniques to minimize the loss function for both classification and regression problems.

Attention will be paid to the training time and loss minimization for the models

Models

For this project, we will primarily focus on two boosting algorithm; CatBoost and Extreme Gradient Boosting algorithm.

Extreme Gradient Boosting Algorithm

Trained on 80% of the dataset.

Performance

xgb

Initial rmse scored 0.5179 [using binary:logistic loss function] Score after Hyperparameter tuning 0.4685 [using binary:logistic loss function] Score using hyperopt tuner rmse 0.4349 [using binary:logistic loss function]

score using synthetic data ===> 0.2872281323269014 [binary:logistic loss function] score using synthetic data ===> 0.2806243040080456 [multi:softmax loss function]

score; hyperopt + synthetic data ===> 0.2645751311064591 [binary:logistic loss function]

catboost

Initial rmse score ===> one hot, 0.4938647983247948, ordinal 0.4324255657551935 optuna rmse score ===> one hot, 0.45083481733371616, ordinal 0.4597631061983315

synthetic data rmse score ===> 0.2337848562121592 [base model]

synthetic data optuna rmse score ====> 0.2936835031117683 synthetic data grid search score ===> 0.27988092706244444

Best rmse scores

Extreme Gradient Boosting [XGB] Classifier ==> 0.2645751311064591 [trained faster than CatBoost classifier]

CatBoost [CB] Classifier ==> 0.27988092706244444

https://github.com/wessware/clustering_algorithms

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published