This is a Python-based web scraper that collects product listings from a popular musical instrument retailer's website. It allows users to interactively select a product category and the number of pages to scrape, then outputs detailed product information to a CSV file.
- Category selection (e.g., Guitars, Drums, Keyboards, etc.)
- User-defined number of pages to scrape
- Extracts the following fields:
- Product Name
- Product Price
- Store Location
- Condition
- Product Image URL
- Product Link URL
- Saves results as a
.csv
file in theoutput
folder
Install required libraries with:
pip install -r requirements.txt
requirements.txt includes:
- requests
- beautifulsoup4
- lxml
- pandas
- fake-useragent
- Clone or download this repository.
- Choose a category by entering the corresponding letter.
- Enter how many pages you want to scrape.
- After scraping, the data will be saved in the output/ directory as a timestamped .csv file.
A sample row in the output CSV:
Title | Price | Store | Condition | Image |
---|---|---|---|---|
Used Fender Strat | $599 | Portland, OR | Great | https://gc.com/image/guitar.jpg |
Be respectful with your scraping. The script includes randomized delays to avoid hammering the site with requests. This scraper is intended for educational and personal use.