Skip to content

vignesh96/scrape-indeed-skills

Repository files navigation

Indeed Scraper

This application scrapes the data from www.indeed.co.in based on the input that is configured under input folder. The scraped data is used for the following -

  • Data is cleaned for duplicates and salary is converted to integer
  • Visualize the top 10 job titles that are in demand
  • Determine which skill sets are sought more

Configuration

The input to be used for scraping is provided under input folder with file name job_search_input.json. The file has the following attributes in the form of the list -

  • field - Field that is sought by you
  • city - City of interest
  • state - State of interest
  • skills_needed - Skills that are to be searched upon

A cron job will be run on the analyse_indeed.py file to scrape the data and form a data frame to the scrapped_data folder. Another cron job will be run on map_skills.py which will clean the dataset and save the visual data in plots folder.

About

My collection

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published