Skip to content

This is a repository to scrape jobs from popular job display platforms like linkedin, weworkremotely and even hr tools like greenhouse, using a combination of web scraping and rest apis

License

Notifications You must be signed in to change notification settings

doobie-droid/job_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

96 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Contributors Forks Stargazers Issues project_license LinkedIn


Job Scraper

A job scraping repo to automate your job searches!

About The Project

There are a few websites that provide custom solutions to scrape a bunch of job related apis. However, most of these sites/platforms require a subscription of some sorts. The matter becomes worse, when the subscription is paid in dollars and a lot of countries have individuals with credit cards that do not allow dollar payments.

I decided to build a job scraper pooling data from the following SOURCES

Prerequisites

Tool Version
Golang ^1.23.4
MySQL ^5.7
Chromium-Browser 133.0.6943.53

Installation

  1. Install dependencies in go

    go mod tidy
    
  2. Setup Chrome Headless

  3. Run database migrations in mysql

  • Log in to mysql

    mysql -u <username> -p
    
  • Enter your password in the empty screen

  • Run the command

    create database job_scraper;
    
  • Run the migrations file whilst in the mysql shell

    SOURCE ./sql/job_migration.sql
    
  1. Run the application
  • Create your own .env file from the .env.example

     cp .env.example .env
    
  • Start the app

    go run .
    

Running Test

  • Run the test within the package

    go test
    
  • Run all test recursively

    go test ./...
    
  • Run all test recursively with coverage

    go test -coverpkg=./... ./...
    
  • Run test with the verbose flag go test -v

About

This is a repository to scrape jobs from popular job display platforms like linkedin, weworkremotely and even hr tools like greenhouse, using a combination of web scraping and rest apis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages