The goal of this project is to analyze a large dataset containing information about mental health in technology-related companies. The evaluation requires using Unsupervised Machine learning techniques to form clusters of similar participants. Once the clusters are formed, visualizations of the clusters and their profiles are used to provide deeper understanding of the main principals of the data set.
3 notebooks were used in order to improve readability and separate stages of analysis.
data_cleaning = 'Task1DataCleaning.ipynb'
data_analysis = 'Task1DataAnalysis.ipynb'
data_feature_importance = 'Task1FeatureImportance.ipynb'
In your terminal enter:
python run_notebooks.py
This will ensure all notebooks are run in sequential order.