🏡 Multiple Linear Regression – California Housing Dataset

This project demonstrates Multiple Linear Regression using the popular California Housing dataset from sklearn.datasets. It explores feature relationships, evaluates model performance using multiple metrics, and finally prepares the model for deployment using Pickling.

📌 Project Overview

Multiple Linear Regression helps us model the relationship between one dependent variable (target) and multiple independent variables (features). In this project, we aim to predict housing prices based on various features from the California Housing dataset.

🗃️ Dataset Details

📦 Source: sklearn.datasets.fetch_california_housing
🧮 Samples: 20,000+
🔢 Features: 8 numerical features
🎯 Target: Price (Median House Value)

🧠 Workflow

Load Dataset & Create DataFrame
- Loaded using fetch_california_housing()
- Converted to a pandas DataFrame
Exploratory Data Analysis
- Used seaborn.pairplot() to visualize relationships
- Created a heatmap to observe feature correlations
Data Preparation
- Split into train/test sets using train_test_split
- Standardized features using StandardScaler
Model Building
- Trained a Multiple Linear Regression model using LinearRegression from scikit-learn
Model Evaluation
- Evaluated with:
  - Mean Squared Error (MSE)
  - Mean Absolute Error (MAE)
  - Root Mean Squared Error (RMSE)
  - R² Score
  - Adjusted R² Score
Assumptions & Residual Analysis
- Plotted residuals using:
  - seaborn.distplot() to check normality
  - Scatter plot of residuals vs predictions to check homoscedasticity
- Found that model accuracy could be improved; performance wasn't optimal
Model Deployment Prep
- Exported the trained model using Pickling (pickle.dump)
- Discussed its usage in cloud-based inference pipelines

📊 Libraries Used

Library	Purpose
`pandas`	Data handling
`numpy`	Numerical computation
`seaborn`	Visualization (pairplot, heatmap)
`matplotlib`	Plotting
`sklearn`	Dataset loading, ML models, metrics
`pickle`	Model serialization

📈 Metrics Used

📉 MSE – Mean Squared Error
📉 MAE – Mean Absolute Error
📉 RMSE – Root Mean Squared Error
📈 R² Score – Goodness of fit
📈 Adjusted R² – R² adjusted for number of features

🗃️ Project Structure

File Name	Description
`Multiple_Linear_Regression.ipynb`	Full model implementation and evaluation
`README.md`	Project documentation (this file)
`model.pkl`	Serialized (pickled) trained model

🚀 How to Run the Project

Clone the Repository

git clone https://github.com/YourUsername/Multiple-Linear-Regression-California.git
cd Multiple-Linear-Regression-California

Install required libraries

pip install pandas numpy matplotlib seaborn scikit-learn

Launch Jupyter Notebook
```
jupyter notebook
```
Open ipynb files and run through the cells.

☁️ Model Deployment Tip

To deploy this model on the cloud:

Load the model.pkl file in your API/backend
Use libraries like Flask, FastAPI, or cloud services like AWS Lambda / Azure Functions
Standardize incoming input data exactly as done before training

Perform prediction using:

import pickle
model = pickle.load(open("model.pkl", "rb"))
prediction = model.predict(new_scaled_data)

👩‍💻 Author

Maitri Prabhu

GitHub: Mai3Prabhu

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Multiple Linear Regression with Assumptions.ipynb		Multiple Linear Regression with Assumptions.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🏡 Multiple Linear Regression – California Housing Dataset

📌 Project Overview

🗃️ Dataset Details

🧠 Workflow

📊 Libraries Used

📈 Metrics Used

🗃️ Project Structure

🚀 How to Run the Project

☁️ Model Deployment Tip

👩‍💻 Author

About

Uh oh!

Releases

Packages

Languages

Mai3Prabhu/Multiple-Linear-Regression-with-Assumptions

Folders and files

Latest commit

History

Repository files navigation

🏡 Multiple Linear Regression – California Housing Dataset

📌 Project Overview

🗃️ Dataset Details

🧠 Workflow

📊 Libraries Used

📈 Metrics Used

🗃️ Project Structure

🚀 How to Run the Project

☁️ Model Deployment Tip

👩‍💻 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages