Project Overview

Note: This is a prototype and not the actual code used in production.

This MVP fraud detection model serves as a benchmark for evaluating the performance of various machine learning algorithms, including XGBoost, CatBoost, Random Forest, Neural Network, and Logistic Regression. The project comprises three Jupyter notebooks dedicated to data analysis, model development, and performance evaluation. The structure and purpose of each notebook are outlined below:

Notebook Summaries

1. a0_Utility_Functions.ipynb

This notebook contains frequently used functions organized as utility macros. These functions can be reused across different projects to streamline tasks such as data preprocessing, visualization, and performance evaluation.

2. a1_Exploratory_Analysis.ipynb

This notebook focuses on exploratory data analysis (EDA). It includes steps for loading data, cleaning, visualizing, and identifying patterns and trends in the dataset.

3. a2_Model_Building_Evaluation.ipynb

This notebook handles the process of building and evaluating machine learning models. It covers tasks like data splitting, model training, hyperparameter tuning, and performance evaluation.

Requirements

Python 3.x
Jupyter Notebook
Required libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn

Usage

Clone the repository.
Install dependencies using pip install -r requirements.txt.
Open each notebook and execute the cells in order.

Author

Sean Seunghyun Kim

Project Overview

This project includes three Jupyter notebooks designed for data analysis, model building, and evaluation tasks. The structure and purpose of each notebook are detailed below:

Notebook Summaries

1. a0_Utility_Functions.ipynb

This notebook contains frequently used functions organized as utility macros. These functions can be reused across different projects to streamline tasks such as data preprocessing, visualization, and performance evaluation.

2. a1_Exploratory_Analysis.ipynb

This notebook focuses on exploratory data analysis (EDA). It includes steps for loading data, cleaning, visualizing, and identifying patterns and trends in the dataset.

3. a2_Model_Building_Evaluation.ipynb

This notebook handles the process of building and evaluating machine learning models. It covers tasks like data splitting, model training, hyperparameter tuning, and performance evaluation.

Requirements

Python 3.x
Jupyter Notebook
Required libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn

Usage

Clone the repository.
Install dependencies using pip install -r requirements.txt.
Open each notebook and execute the cells in order.

Author

Sean Seunghyun Kim

Contact

Email: seunghyk@tepper.cmu.edu Phone: (949) 572 7370

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
a0_utility_functions.py		a0_utility_functions.py
a1_exploratory_analysis.ipynb		a1_exploratory_analysis.ipynb
a2_model_building_evaluation.ipynb		a2_model_building_evaluation.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Overview

Notebook Summaries

1. a0_Utility_Functions.ipynb

2. a1_Exploratory_Analysis.ipynb

3. a2_Model_Building_Evaluation.ipynb

Requirements

Usage

Author

Project Overview

Notebook Summaries

1. a0_Utility_Functions.ipynb

2. a1_Exploratory_Analysis.ipynb

3. a2_Model_Building_Evaluation.ipynb

Requirements

Usage

Author

Contact

About

Uh oh!

Releases

Packages

Languages

seankim0/fraud_detection

Folders and files

Latest commit

History

Repository files navigation

Project Overview

Notebook Summaries

1. a0_Utility_Functions.ipynb

2. a1_Exploratory_Analysis.ipynb

3. a2_Model_Building_Evaluation.ipynb

Requirements

Usage

Author

Project Overview

Notebook Summaries

1. a0_Utility_Functions.ipynb

2. a1_Exploratory_Analysis.ipynb

3. a2_Model_Building_Evaluation.ipynb

Requirements

Usage

Author

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages