Skip to content

seankim0/fraud_detection

Repository files navigation

Project Overview

Note: This is a prototype and not the actual code used in production.

This MVP fraud detection model serves as a benchmark for evaluating the performance of various machine learning algorithms, including XGBoost, CatBoost, Random Forest, Neural Network, and Logistic Regression. The project comprises three Jupyter notebooks dedicated to data analysis, model development, and performance evaluation. The structure and purpose of each notebook are outlined below:

Notebook Summaries

1. a0_Utility_Functions.ipynb

This notebook contains frequently used functions organized as utility macros. These functions can be reused across different projects to streamline tasks such as data preprocessing, visualization, and performance evaluation.

2. a1_Exploratory_Analysis.ipynb

This notebook focuses on exploratory data analysis (EDA). It includes steps for loading data, cleaning, visualizing, and identifying patterns and trends in the dataset.

3. a2_Model_Building_Evaluation.ipynb

This notebook handles the process of building and evaluating machine learning models. It covers tasks like data splitting, model training, hyperparameter tuning, and performance evaluation.


Requirements

  • Python 3.x
  • Jupyter Notebook
  • Required libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn

Usage

  1. Clone the repository.
  2. Install dependencies using pip install -r requirements.txt.
  3. Open each notebook and execute the cells in order.

Author

Sean Seunghyun Kim

Project Overview

This project includes three Jupyter notebooks designed for data analysis, model building, and evaluation tasks. The structure and purpose of each notebook are detailed below:

Notebook Summaries

1. a0_Utility_Functions.ipynb

This notebook contains frequently used functions organized as utility macros. These functions can be reused across different projects to streamline tasks such as data preprocessing, visualization, and performance evaluation.

2. a1_Exploratory_Analysis.ipynb

This notebook focuses on exploratory data analysis (EDA). It includes steps for loading data, cleaning, visualizing, and identifying patterns and trends in the dataset.

3. a2_Model_Building_Evaluation.ipynb

This notebook handles the process of building and evaluating machine learning models. It covers tasks like data splitting, model training, hyperparameter tuning, and performance evaluation.


Requirements

  • Python 3.x
  • Jupyter Notebook
  • Required libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn

Usage

  1. Clone the repository.
  2. Install dependencies using pip install -r requirements.txt.
  3. Open each notebook and execute the cells in order.

Author

Sean Seunghyun Kim

Contact

Email: seunghyk@tepper.cmu.edu Phone: (949) 572 7370

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published