Scrapy Project

This repository contains multiple Python scripts for web scraping using the requests and BeautifulSoup libraries. The scripts demonstrate how to scrape information from websites like IMDb and Quotes to Scrape.

Files

1. `main.ipynb` - IMDb Top 25 Movies Scraper

This Jupyter notebook scrapes the top 25 movies from IMDb's top chart. It extracts the movie titles and their ratings.

How it works:

It sends a GET request to the IMDb Top 250 page.
It parses the HTML content using BeautifulSoup.
It extracts the movie titles and ratings, then prints them out in a numbered list.

How to run:

Install the required dependencies:
```
pip install requests beautifulsoup4
```
Run the notebook in Jupyter Notebook or Jupyter Lab to view the top 25 movies and their ratings.

2. `quotes.ipynb` - Quotes Scraper

This Jupyter notebook scrapes random quotes from the website https://quotes.toscrape.com. It extracts the text of each quote and prints them.

How it works:

It sends a GET request to the Quotes to Scrape website.
It parses the HTML content using BeautifulSoup.
It finds all quotes marked with the <span class="text"> tag and prints them out with numbering.

How to run:

Install the required dependencies:
```
pip install requests beautifulsoup4
```
Run the notebook in Jupyter Notebook or Jupyter Lab to see the scraped quotes.

3. `scrapy.py` - Basic Web Scraping Example

This is a simple script that demonstrates how to scrape paragraphs (<p> tags) from a webpage. The script is meant to give you a basic understanding of web scraping.

How it works:

It sends a GET request to a sample URL (https://example.com).
It parses the HTML content and extracts all paragraphs.
It prints the text of each paragraph on the page.

How to run:

Install the required dependencies:
```
pip install requests beautifulsoup4
```
Run the script using Python:
```
python scrapy.py
```

Setup

Prerequisites:

Python 3.x
Jupyter Notebook or Jupyter Lab (for .ipynb files)
requests and beautifulsoup4 libraries

Installing Dependencies:

Install the necessary Python packages for web scraping:

pip install requests beautifulsoup4

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitattributes		.gitattributes
README.md		README.md
main.ipynb		main.ipynb
quotes.ipynb		quotes.ipynb
scrapy.py		scrapy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scrapy Project

Files

1. `main.ipynb` - IMDb Top 25 Movies Scraper

How it works:

How to run:

2. `quotes.ipynb` - Quotes Scraper

How it works:

How to run:

3. `scrapy.py` - Basic Web Scraping Example

How it works:

How to run:

Setup

Prerequisites:

Installing Dependencies:

About

Uh oh!

Releases

Packages

Languages

aranyaadheu/Scrapy-Project

Folders and files

Latest commit

History

Repository files navigation

Scrapy Project

Files

1. main.ipynb - IMDb Top 25 Movies Scraper

How it works:

How to run:

2. quotes.ipynb - Quotes Scraper

How it works:

How to run:

3. scrapy.py - Basic Web Scraping Example

How it works:

How to run:

Setup

Prerequisites:

Installing Dependencies:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. `main.ipynb` - IMDb Top 25 Movies Scraper

2. `quotes.ipynb` - Quotes Scraper

3. `scrapy.py` - Basic Web Scraping Example

Packages