DLBDSMLUSL01 - Unsupervised Machine Learning and Feature Engineering

Task 1: Mental Health in Technology-related jobs

Goal:

The goal of this project is to analyze a large dataset containing information about mental health in technology-related companies. The evaluation requires using Unsupervised Machine learning techniques to form clusters of similar participants. Once the clusters are formed, visualizations of the clusters and their profiles are used to provide deeper understanding of the main principals of the data set.

Project Format

3 notebooks were used in order to improve readability and separate stages of analysis.

1. Data Cleaning

data_cleaning = 'Task1DataCleaning.ipynb'

2. Encoding, Scaling, PCA, and K-Means

data_analysis = 'Task1DataAnalysis.ipynb'

3. Feature Importance

data_feature_importance = 'Task1FeatureImportance.ipynb'

For optimum results if downloading the entire folder please use 'run_notebooks.py'.

In your terminal enter:

python run_notebooks.py

This will ensure all notebooks are run in sequential order.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.idea		.idea
README.md		README.md
Task1DataAnalysis.ipynb		Task1DataAnalysis.ipynb
Task1DataCleaning.ipynb		Task1DataCleaning.ipynb
Task1FeatureImportance.ipynb		Task1FeatureImportance.ipynb
requirements.txt		requirements.txt
run_notebooks.py		run_notebooks.py
tech_df_cleaned.csv		tech_df_cleaned.csv
tech_df_encoded_scaled_clustered.csv		tech_df_encoded_scaled_clustered.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DLBDSMLUSL01 - Unsupervised Machine Learning and Feature Engineering

Task 1: Mental Health in Technology-related jobs

Goal:

Project Format

1. Data Cleaning

2. Encoding, Scaling, PCA, and K-Means

3. Feature Importance

For optimum results if downloading the entire folder please use 'run_notebooks.py'.

For required packages please see 'requirements.txt'

About

Uh oh!

Releases

Packages

Languages

AlexandraFaria/UnsupervisedLearningTask1Project

Folders and files

Latest commit

History

Repository files navigation

DLBDSMLUSL01 - Unsupervised Machine Learning and Feature Engineering

Task 1: Mental Health in Technology-related jobs

Goal:

Project Format

1. Data Cleaning

2. Encoding, Scaling, PCA, and K-Means

3. Feature Importance

For optimum results if downloading the entire folder please use 'run_notebooks.py'.

For required packages please see 'requirements.txt'

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages