Skip to content

ScraperHub/amazon-best-sellers-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

crawling-api-cta

amazon-best-sellers-scraper

Description

This repository contains a Python-based scraper for extracting Amazon Best Sellers product data. The scraper uses the Crawlbase Crawling API with the Amazon Best Sellers scraper, ensuring seamless data extraction while bypassing Amazon’s anti-bot protections.

➡ Read the full blog here to learn more.

Scraper Overview

Amazon Best Sellers Scraper

The amazon_best_sellers_scraper.py extracts top-selling product details from the Amazon Best Sellers page, including:

  • Product Title
  • Price
  • Customer Review Rating
  • Number of Reviews
  • Review Page Link
  • ASIN
  • Product Image URL
  • Product Page Link
  • Amazon Prime Availability
  • Product Rank
  • Categories (Selected & Other Available Categories)
  • Pagination Details (Current Page & Next Page Number)

The scraper automatically retrieves structured data in JSON format, making it easy to process and analyze.

Environment Setup

Ensure Python is installed on your system. Check the version using:

python --version

Install the required dependency:

pip install crawlbase

Running the Scraper

1. Get Your Crawlbase Access Token

  • Sign up on Crawlbase to get an API token.
  • This token is required to access the Crawling API for bypassing Amazon’s bot protection.

2. Update the Scraper with Your Token

Replace "CRAWLBASE_API_TOKEN" in the script with your Crawlbase Crawling API Token.

3. Run the Scraper

python amazon_best_sellers_scraper.py

The extracted Amazon Best Sellers data will be saved in a JSON file named amazon_best_sellers.json.

About

Amazon Best Sellers Scraper in Python. In the script, we are using Crawlbase "amazon-best-sellers" scraper.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages