XGBoost Inference Pipeline for Heart Disease UCI

Project Overview

This project implements an inference pipeline for heart disease prediction using a pre-trained XGBoost model. It provides functionality for loading the model, preprocessing new data, and making predictions.

Project Structure

gillopy-Deployment_XGBoost_Inference_Heart_Disease_UCI/
├── README.md                    # Project documentation
├── Dockerfile                   # Docker container configuration
├── LICENSE                      # Project license
├── pyproject.toml              # Poetry dependency management
├── .dockerignore               # Docker build exclusions
├── models/                     # Pre-trained model files
│   ├── trained_model_2025-01-06.joblib
│   └── trained_model_2025-01-08.joblib
├── src/                        # Source code
│   ├── data_preprocessor.py    # Data preprocessing functionality
│   ├── inference.py           # Main inference pipeline
│   └── model_loader.py        # Model loading utilities
└── tests/                      # Test suite
    ├── __init__.py
    ├── test_data_preprocessor.py
    ├── test_inference.py
    └── test_model_loader.py

Requirements

Python Version

Python 3.10 (specific version requirement)

Dependencies

All dependencies are managed through Poetry and specified in pyproject.toml:

pandas (^2.2.3)
scikit-learn (^1.6.0)
xgboost (^2.1.3)
joblib (^1.4.2)

Setup and Installation

Clone the Repository:

git clone https://github.com/gillopy/Deployment_XGBoost_Inference_Heart_Disease_UCI
cd Deployment_XGBoost_Inference_Heart_Disease_UCI

Install Dependencies:
```
poetry install
```

Docker Setup (optional):

docker build -t heart-disease-inference .

Usage

Running Inference

Execute the main inference script:

poetry run python src/inference.py

Input Data Format

The model expects input data in the following format:

{
    "age": int,
    "sex": int,
    "cp": int,
    "trestbps": int,
    "chol": int,
    "fbs": int,
    "restecg": int,
    "thalach": int,
    "exang": int,
    "oldpeak": float,
    "slope": int,
    "ca": float,
    "thal": float
}

Testing

Run the test suite using pytest:

poetry run pytest

The test suite includes:

Data preprocessing validation
Inference pipeline testing
Model loading verification

Project Components

Data Preprocessor

Handles missing value imputation
Converts input dictionary to DataFrame format
Implements data validation checks

Model Loader

Loads the pre-trained XGBoost model
Includes error handling for missing model files
Validates model compatibility

Inference Pipeline

Orchestrates the complete inference process
Supports batch predictions
Provides formatted output

Docker Support

The project includes Docker support for containerized deployment:

Base Python 3.10 image
Automatic dependency installation
Environment isolation

License

Apache License 2.0

Author

Guillermo (guillermocabrera9710@gmail.com)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

XGBoost Inference Pipeline for Heart Disease UCI

Project Overview

Project Structure

Requirements

Python Version

Dependencies

Setup and Installation

Usage

Running Inference

Input Data Format

Testing

Project Components

Data Preprocessor

Model Loader

Inference Pipeline

Docker Support

License

Author

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
models		models
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

License

gillopy/Deployment_XGBoost_Inference_Heart_Disease_UCI

Folders and files

Latest commit

History

Repository files navigation

XGBoost Inference Pipeline for Heart Disease UCI

Project Overview

Project Structure

Requirements

Python Version

Dependencies

Setup and Installation

Usage

Running Inference

Input Data Format

Testing

Project Components

Data Preprocessor

Model Loader

Inference Pipeline

Docker Support

License

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages