Skip to content

Automated org research tooling #100

@rivernews

Description

@rivernews

Develop a micro service that can research the following:

  • Logo icon
  • Size
  • Found year
  • Industry
  • Description
  • HQ location
  • Engineering team size, demographic: name, title
  • Numeric ratings, breakdown ratings
  • (Don't deal with qualitative review text at this point)

Then, we can have some cronjob to POST data to appl-tracky, and display data in UI.

Scraping in Go

We need golang javascript scraper, this blog sum up some great scrapers.

  • Using chromedp.
    • How to test/run on local - we can't mount volume src code in, but we can copy compiled executable in image and run it.
  • On AWS - at least we can use docker layer. Like Selenium image.
    • Ideally no docker needed
    • But seems browser solution all needs container - perhaps because browser binary is going to be platform-specific.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions