GitHub - abhiabhi94/job-board: A python based web app that fetches jobs from different portals and allows users to filter them

Introduction

A job board, this was initially built out of boring job hunting. This basically scans through different portals and based upon the preferences like keywords, salary, location etc., and shows the jobs that match your preference.

Portals Integrated

Configurations

Most configurations can be set through a .env file. All configurations can be found in job_board/config.py file.

API

The API endpoint is /.json. All filters that are available on the UI are also available on the JSON.

CLI

Most options should be available using the --help flag.

job-board --help

Running the webserver in debug mode.

job-board runserver -d

Fetching the jobs immediately

job-board fetch

Run it for only specific portals(include these portals)

job-board fetch -I weworkremotely -I python_dot_org

Run it for all portals, but exclude some(maybe the portal is down, etc)

job-board fetch -E wellfound -E work_at_a_startup

Start the job scheduler (runs jobs according to their cron schedules)

job-board scheduler start

List all registered scheduled jobs

job-board scheduler list-jobs

Run a specific job manually

job-board scheduler run-job fetch_jobs_daily

Remove all scheduled jobs (useful before deployment)

job-board scheduler remove-jobs

Tests

Python Tests

ENV=test pytest

JavaScript Tests

# Install JavaScript dependencies first
npm install

# Run JavaScript tests
npm run test:run

# Run with coverage
npm run test:coverage

Contributing

Please use global gitignore, rather than adding a gitignore to the repository. A writeup illustrating the reasoning behind this decision: https://sebastiandedeyne.com/setting-up-a-global-gitignore-file/

Installing development version

pip install -e ".[dev]"
pre-commit install

CSS Development

Development: Uses Tailwind CDN (set ENV=dev)
Production: Uses optimized local CSS
Pre-commit hooks auto-build CSS when templates change
Manual build: bash scripts/build-tailwind.sh

Integrations Per Portal

The below text is mostly written as a note to future me. In hope, that it helps to debug in case of an issue.

WeWorkRemotely

Although, they have a public RSS feed, for some reason they seem to be using some sort of cloudfare protection that is blocking HTTP requests from scripts.
So scrapfly is used to bypass it.

Remotive

Although, they have API for fetching jobs, the data is pretty unstructured.

Python

They have a public RSS feed, so the integration is mostly straightforward.

Himalayas

They have a public API, so the integration is straightforward.

Wellfound

They have special mechanisms setup to stop scripts from scraping their website.
So scrapfly along with its ASP(Anti Scraping Protection) feature is used to bypass them.
- Although this works, it makes the whole integration very slow since it takes close to 50 - 200 seconds to scrape a single page.
- Total pages to scrape maybe around 20 - 40.
- So yeah, a better alternative that reliably works faster is welcome.

Work At A Startup

They don't show all jobs unless you're logged in to their profile.
For now, the browser cookies(after logging in) are used to make requests and scrape.
- These cookies seem to be long-lasting(haven't needed to change them even once since this was implemented.)

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
.github/workflows		.github/workflows
infra		infra
job_board		job_board
scripts		scripts
tests		tests
.gitattributes		.gitattributes
.pre-commit-config.yaml		.pre-commit-config.yaml
.test.env		.test.env
README.md		README.md
alembic.ini		alembic.ini
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
vitest.config.js		vitest.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Portals Integrated

Configurations

API

CLI

Tests

Python Tests

JavaScript Tests

Contributing

Installing development version

CSS Development

Integrations Per Portal

WeWorkRemotely

Remotive

Python

Himalayas

Wellfound

Work At A Startup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

abhiabhi94/job-board

Folders and files

Latest commit

History

Repository files navigation

Introduction

Portals Integrated

Configurations

API

CLI

Tests

Python Tests

JavaScript Tests

Contributing

Installing development version

CSS Development

Integrations Per Portal

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Uh oh!

Languages