📦 End-to-End MLOps Pipeline with ZenML, MLflow, VS Code, and WSL on Windows

This project demonstrates how to develop, evaluate, and deploy a machine learning model using a fully automated MLOps pipeline built with ZenML and MLflow, to predict the house price through providing specific inputs/features the model is trained on. It is specifically set up to work on Windows 11 using VS Code Remote-WSL and Ubuntu under WSL 2.

We automate: -- Data ingestion ➔ Preprocessing ➔ Model training ➔ Evaluation ➔ Deployment with artifact tracking and service orchestration.

🎥 Project Demo Video

Watch the walkthrough of this project on LinkedIn:

🛠️ Project Structure

House-prices-predict-MLOps/
├── .venv/                # Virtual environment
├── .zen/                 # ZenML metadata
├── analysis/             # Exploratory data analysis
├── data/                 # Source of data
├── design_patterns/      # Reusable MLOps design patterns
├── extracted_data/       # Extracted raw dataset(s)
├── mlruns/               # MLflow experiment tracking
├── pipelines/            # ZenML pipelines definitions
├── src/                  # Core code (feature engineering, model building, etc.)
├── steps/                # Individual ZenML steps
├── tests/                # Unit and integration tests
├── config.yaml           # Project configuration file
├── README.md             # Project documentation
├── requirements.txt      # Project dependencies
├── run_pipeline.py       # Script to run the training pipeline
├── run_deployment.py     # Script to deploy the best model
├── sample_predict.py     # Script to test prediction after deployment

⚙️ How to Set Up the Project

Follow these steps carefully to reproduce the MLOps environment and workflow.

1. Install WSL and Ubuntu on Windows 11

Open PowerShell (as Admin):
```
wsl --install
```
Ensure you have Ubuntu 22.04 LTS installed.
Verify WSL version:
```
wsl --list --verbose
```
Ensure it's WSL 2.

2. Set Up VS Code Remote WSL

Open VS Code by opening the WSL (Start Menu > type WSL > open): -- Go to your project folder/directory (e.g., cd mct/F/AI/MLOps/prices-predict-mlops) -- Type the command: code .
Install the Remote - WSL extension in VS Code.
Open VS Code, press Ctrl+Shift+P, select "Remote-WSL: New Window".

3. Prepare Python Environment Inside WSL

Update and install essentials:

  sudo apt update
  sudo apt install python3.10 python3.10-venv python3-pip

4. Clone This Repository

Inside WSL terminal:

git clone https://github.com/razyousuf/MLOps.git
cd MLOps

5. Create a Virtual Environment

    python3.10 -m venv .venv
    source .venv/bin/activate

6. Install Python Dependencies

    pip install --upgrade pip
    pip install -r requirements.txt

Note: If you see externally-managed-environment error, make sure you are inside your virtual environment.

7. Initialize and Set Up ZenML

zenml init

This creates the .zen directory and a local ZenML repository.

8. ZenML Stack Configuration

This project requires a ZenML stack with MLflow components. Here's how to set it up:

a. Install Required Integrations:

zenml integration install mlflow -y

b. Register MLflow Components:

# Register MLflow Experiment Tracker
zenml experiment-tracker register mlflow_tracker --flavor=mlflow

# Register MLflow Model Deployer
zenml model-deployer register mlflow --flavor=mlflow

c. Create and Activate Stack:

zenml stack register local-mlflow-stack \
  -a default \          # Artifact Store
  -o default \          # Orchestrator
  -d mlflow \           # Model Deployer
  -e mlflow_tracker \   # Experiment Tracker
  --set                 # Activate immediately

d. Verify Your Stack:

zenml stack describe

9. Run the ZenML:

zenml up --blocking

Or manually register your custom stack if needed.

10. Execute the pipeline:

python run_pipeline.py

Steps included:
🧩 Steps Directory Breakdown

Each step is a ZenML @step that encapsulates a single stage of the pipeline:

Step File	Purpose
`data_ingestion_step.py`	Load raw dataset into memory
`data_splitter_step.py`	Split dataset into training/validation/test sets
`dynamic_importer.py`	Dynamically import external datasets during deployment
`feature_engineering_step.py`	Apply feature transformations (encoding, scaling)
`handle_missing_values_step.py`	Impute or handle missing data
`model_building_step.py`	Train ML models (XGBoost, RandomForest)
`model_evaluator_step.py`	Evaluate model performance
`model_loader.py`	Load pre-trained models during deployment
`outlier_detection_step.py`	Detect and handle data outliers
`prediction_service_loader.py`	Load prediction service endpoint
`predictor.py`	Send inference requests to deployed model

🔵 Key Features

Modular Design: Each step is reusable and independent
Flexible Swapping:
- Change models easily
- Plug in different feature pipelines
- Hot-swap deployment methods
MLOps Best Practices:
- Caching enabled for all steps
- Automatic experiment tracking
- Model versioning

Pipelines automatically log parameters, metrics, and artifacts to MLflow UI.

11. Access the MLflow Tracking Server

After running the pipeline:

mlflow ui

Navigate to: http://127.0.0.1:5000
View experiments, parameters, models, and deployment statuses.

❗ Common Issues and Solutions

Problem	Solution
`Daemon functionality is currently not supported on Windows`	Use WSL and run everything inside WSL Linux environment.
`externally-managed-environment` error during pip install	Always activate your virtual environment created inside WSL (`source .venv/bin/activate`).
VS Code switches back to Windows when opening folders	Make sure to "Open Folder" inside VS Code's Remote-WSL window. Do not open from Windows file explorer.
MLflow server not accessible	Ensure ports 5000 (MLflow) are open and you are running MLflow from inside WSL.

📋 Key Tools and Technologies

ZenML — MLOps orchestration and pipeline automation
MLflow — Experiment tracking and model registry
VS Code — Lightweight IDE with WSL remote development
WSL 2 (Ubuntu 24.04.1) — Linux environment on Windows 11
Python 3.10.17 — Programming language for ML pipeline
Scikit-learn — Machine Learning library

🌟 What You Will Learn

How to fully automate an ML project lifecycle: from development to deployment.
How to structure your ML codebase for MLOps.
How to properly set up Python, ZenML, and MLflow in Windows using WSL.
How to track experiments, models, and metrics through MLflow.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📦 End-to-End MLOps Pipeline with ZenML, MLflow, VS Code, and WSL on Windows

🎥 Project Demo Video

🛠️ Project Structure

⚙️ How to Set Up the Project

1. Install WSL and Ubuntu on Windows 11

2. Set Up VS Code Remote WSL

3. Prepare Python Environment Inside WSL

4. Clone This Repository

5. Create a Virtual Environment

6. Install Python Dependencies

7. Initialize and Set Up ZenML

8. ZenML Stack Configuration

9. Run the ZenML:

10. Execute the pipeline:

🧩 Steps Directory Breakdown

🔵 Key Features

11. Access the MLflow Tracking Server

❗ Common Issues and Solutions

📋 Key Tools and Technologies

🌟 What You Will Learn

📌 References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
analysis		analysis
data		data
extracted_data		extracted_data
images		images
mlruns/0		mlruns/0
pipelines		pipelines
src		src
steps		steps
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
requirements.txt		requirements.txt
run_deployment.py		run_deployment.py
run_pipeline.py		run_pipeline.py
sample_predict.py		sample_predict.py

razyousuf/House-Price-Prediction-MLOps

Folders and files

Latest commit

History

Repository files navigation

📦 End-to-End MLOps Pipeline with ZenML, MLflow, VS Code, and WSL on Windows

🎥 Project Demo Video

🛠️ Project Structure

⚙️ How to Set Up the Project

1. Install WSL and Ubuntu on Windows 11

2. Set Up VS Code Remote WSL

3. Prepare Python Environment Inside WSL

4. Clone This Repository

5. Create a Virtual Environment

6. Install Python Dependencies

7. Initialize and Set Up ZenML

8. ZenML Stack Configuration

9. Run the ZenML:

10. Execute the pipeline:

🧩 Steps Directory Breakdown

🔵 Key Features

11. Access the MLflow Tracking Server

❗ Common Issues and Solutions

📋 Key Tools and Technologies

🌟 What You Will Learn

📌 References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages