Welcome to the Employee Salary Prediction project!
This machine learning model predicts whether an employee's salary is above or below $50K annually, based on their personal and professional attributes.
- ๐ Data Source: UCI Adult Income Dataset
- โ Model Type: Supervised Classification
- ๐ Algorithms: Logistic Regression, Random Forest, Gradient Boosting, SVM, KNN
- ๐ Deployment Option: Streamlit / Flask
- ๐งฉ Output: Predicts if salary is >50K or <=50K
โโโ data/ # Raw dataset
โโโ notebooks/ # EDA & model building notebooks
โโโ requirements.txt
โโโ README.md
Tool/Library | Purpose |
---|---|
Python | Programming Language |
Pandas | Data Manipulation |
NumPy | Numerical Computation |
Scikit-learn | ML Algorithms & Preprocessing |
Matplotlib | Data Visualization |
Joblib | Model Serialization |
Streamlit | Web App Deployment (Optional) |
- Age
- Education Level
- Occupation
- Hours per Week
- Work Experience
- Native Country
- Marital Status
- Salary Label (>50K or <=50K)
-
Clone the repository
bash git clone https://github.com/Naveen-Yerrannagari/Employee-Salary-Prediction-.git cd Employee-Salary-Prediction -
Install the dependencies
bash pip install -r requirements.txt
-
Train the model
bash python src/train_model.py
-
Launch the Streamlit app (Optional)
bash streamlit run app.py
- โ Accuracy: 85%+ (varies by algorithm)
- ๐ Precision, Recall, F1-score: Reported in terminal output
- ๐ Confusion Matrix & Feature Importance: Visualized in notebook
Feature | Value |
---|---|
Age | 34 |
Education | Bachelors |
Occupation | Tech-support |
Hours/Week | 40 |
Experience | 5 |
Prediction | ๐ฐ <=50K |
- ๐ฆ Add advanced models like XGBoost or LightGBM
- ๐งฎ Include regression mode to predict actual salary amount
- ๐ Add country-wise salary normalization
- ๐งพ Improve explainability using SHAP / LIME
- ๐ UCI Machine Learning Repository โ Adult Income Dataset
- ๐ Scikit-learn Documentation
- ๐ Gรฉron, A. (2019). Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow
- ๐ฌ Kaggle Discussions
Contributions, bug reports, and feature requests are welcome! Please open an issue or submit a pull request.
---# Employee-Salary-Prediction- The goal is to predict whether an employee's salary exceeds a specific threshold (e.g., $50,000) based on various personal and professional attributes such as age, education level, occupation, hours worked per week, and years of experience.