Skip to content

Ironhack-Data-0621-Remote/lab-web-scraping-multipages

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

logo_ironhack_blue 7

Lab | Web Scraping Multiple Pages

Business goal:

  • Check the case_study_gnod.md file.

  • Make sure you've understood the big picture of your project:

    • the goal of the company (Gnod),
    • their current product (Gnoosic),
    • their strategy, and
    • how your project fits into this context.

    Re-read the business case and the e-mail from the CTO, take a look at the flowchart and create an initial Trello (or git-hub project) board with the tasks you think you'll have to accomplish.

Instructions

Prioritize the MVP

In the first notebook, you have to scrape data about "hot songs". It's critical to be on track with that part, as it was part of the request from the CTO.

Expand the project

If you're done, you can try to expand the project on your own. Here are a few suggestions:

  • Find other lists of hot songs on the internet and scrape them too: having a bigger pool of songs will be awesome!
  • Apply the same logic to other "groups" of songs: the best songs from a decade or from a country / culture / language / genre.
  • Wikipedia maintains a large collection of lists of songs: https://en.wikipedia.org/wiki/Lists_of_songs

Practice web scraping

Go to Further_questions file to answer some more questions.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published