This application scrapes the data from www.indeed.co.in based on the input that is configured under input folder. The scraped data is used for the following -
- Data is cleaned for duplicates and salary is converted to integer
- Visualize the top 10 job titles that are in demand
- Determine which skill sets are sought more
The input to be used for scraping is provided under input folder with file name job_search_input.json. The file has the following attributes in the form of the list -
- field - Field that is sought by you
- city - City of interest
- state - State of interest
- skills_needed - Skills that are to be searched upon
A cron job will be run on the analyse_indeed.py file to scrape the data and form a data frame to the scrapped_data folder. Another cron job will be run on map_skills.py which will clean the dataset and save the visual data in plots folder.