This project monitors the performance of a fraud detection machine learning model using NannyML library to detect model drift and performance degradation.
Banks use machine learning models to detect fraudulent transactions, but changing data patterns can weaken these defenses. This project helps identify:
- Performance alerts in model accuracy
- Feature drift detection
- Unusual transaction patterns
- reference.csv: Historical test data used to establish baseline model performance
- analysis.csv: Production data for monitoring model performance over time
timestamp
: Date of the transactiontime_since_login_min
: Time since user logged intransaction_amount
: Amount in Pounds (£)transaction_type
: CASH-OUT, PAYMENT, CASH-IN, TRANSFERis_first_transaction
: Binary indicator for first transactionuser_tenure_months
: Account age in monthsis_fraud
: Ground truth fraud labelpredicted_fraud_proba
: Model prediction probabilitypredicted_fraud
: Model prediction (binary)
- Performance Alerts: [April 2019, May 2019, June 2019]
- Most Drifted Feature: time_since_login_min
- Root Cause: User login patterns changed significantly, affecting model accuracy
pip install -r requirements.txt