
This repository contains a Python-based scraper for extracting Amazon Best Sellers product data. The scraper uses the Crawlbase Crawling API with the Amazon Best Sellers scraper, ensuring seamless data extraction while bypassing Amazon’s anti-bot protections.
➡ Read the full blog here to learn more.
The amazon_best_sellers_scraper.py
extracts top-selling product details from the Amazon Best Sellers page, including:
- Product Title
- Price
- Customer Review Rating
- Number of Reviews
- Review Page Link
- ASIN
- Product Image URL
- Product Page Link
- Amazon Prime Availability
- Product Rank
- Categories (Selected & Other Available Categories)
- Pagination Details (Current Page & Next Page Number)
The scraper automatically retrieves structured data in JSON format, making it easy to process and analyze.
Ensure Python is installed on your system. Check the version using:
python --version
Install the required dependency:
pip install crawlbase
- Sign up on Crawlbase to get an API token.
- This token is required to access the Crawling API for bypassing Amazon’s bot protection.
Replace "CRAWLBASE_API_TOKEN
" in the script with your Crawlbase Crawling API Token.
python amazon_best_sellers_scraper.py
The extracted Amazon Best Sellers data will be saved in a JSON file named amazon_best_sellers.json
.