Skip to content

Run scraper on local machine to gather regulations.gov comments for the NILC #57

@dotj

Description

@dotj

Continuation of #48:

We have not yet been able to get a VM up to run the scraper, so we need your help running the scraper locally in order to gather an initial dataset that the NILC can look at.

The documentation for the scraper can be found here: https://github.com/Data4Democracy/immigration-connect/tree/master/public-charge/scraper

Ping me (@dotj) or @alejandrox1 here or post in the #immigration-connect slack page if you need help,

We've seen each page (50 comments) take about 4 minutes to scrape, and there are currently almost 10k comments, so it will take about 13 hours total. Of course, this is dependent on your internet speed and various other factors.

Tasks

  • Set up the scraper locally
  • Let the scraper run and collect all the comments (~13 hours)

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions