According to steps 1 and 2 (Business Understanding and Data Understanding) from the CRISP-DM, questions were proposed according to employment and salary factors to take into account when getting hired or seeking a job as a developer.
- Installation
- Project Explanation
- Project Files
- Results
- Licences and Acknowledgements
For getting ready to run the code with the analysis, it's necessary to have the dependencies downloaded, for it, you'll have to:
- cd to the directory where requirements.txt is located.
- activate your virtualenv.
- run: pip install -r requirements.txt in your prompt.
Udacity Data Science nanodegree project for developing analysis abilities with communicative and problem-solution purposes using real industry data, python and dependencies for data wrangling as pandas. CRISP-DM was the main tool or getting the job done as it is a way of organizing and having the main purposes clear.
Main files of the project are:
- notebook_data_analysis.ipynb (Jupyter Notebook with the process of wrangling data)
- requirements.txt (File for installing dependencies on your environment)
- README_2020.txt (Licence of StackOverflow's survey results)
- survey_results_public.csv (CSV containing all data rfom survey's responses)
- survey_results_schema.csv (CVS containing questions according to each column on the data from survey_results_public.csv)
For checking up what the final analysis textually made was, visit my Medium post here.
StackOverflow's licence and dataset used in the link here and also terms of use in the README_2020.txt file mentioned 2 sections before.
Code, analysis and writing made by Juan David Pisco Jaimes.