Web Scraping using BeautifulSoup in Computational Gastronomy

Dealing with the Data that is present on recipeall website and plotting some important graphs

Web Scraping (BeautifulSoup)

This projects deals with the scraped data from food websites and use that data to plot relevant graphs. Some important distributions includes frequency-rank distributions, recipe size distributions

Why I Use BeautifulSoup or Jupyter Notebook

I use BeautifulSoup for web scraping because it provides a powerful yet simple interface to navigate and extract data from HTML and XML documents. Its flexibility allows to efficiently parse complex web pages, extract relevant information like text, links, and images, and convert them into structured formats. Jupyter Notebook complements this by offering an interactive environment where I can write and execute code in real time, visualize data instantly, and document my thought process. This combination makes it easier to experiment, debug, and refine scraping scripts quickly, while also providing a platform for sharing and presenting results effectively.

Main Components

.ipynb files: These jupyter files contained the code for problem statement.
.csv and .txt file: The output of the data extractions are stored in this files.

Clone the Repository

git clone https://github.com/akshatrajsaxena/CGAS_Assignment_1.git

Install Dependencies

Install the neccessary Libraries that is mentioned in the file Q1.ipynb file. The libraries are mentioned below:

pip install requests 
pip install html5lib 
pip install bs4 
pip install pandas 
pip install lxml 
pip install spacy 
pip install scikit-learn spacy 
pip install transformers 
pip install torch 
pip install matplotlib

If you find any issues or have suggestions for improvements, feel free to open an issue or submit a pull request. Contributions are welcome!

LICENSE

MIT License

Contact

If you have any questions or would like to get in touch, you can reach me at Akshat Raj Saxena or Aditya Sharma

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Q1		Q1
Q2		Q2
Q3		Q3
CGAS-BIO544-Assignment_1_Problems.pdf		CGAS-BIO544-Assignment_1_Problems.pdf
CGAS_Assignment_1_Q4_Q8.pdf		CGAS_Assignment_1_Q4_Q8.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Web Scraping using BeautifulSoup in Computational Gastronomy

Web Scraping (BeautifulSoup)

Why I Use BeautifulSoup or Jupyter Notebook

Clone the Repository

Install Dependencies

LICENSE

Contact

About

Uh oh!

Releases

Packages

Languages

License

akshatrajsaxena/CGAS_Assignment_1

Folders and files

Latest commit

History

Repository files navigation

Web Scraping using BeautifulSoup in Computational Gastronomy

Web Scraping (BeautifulSoup)

Why I Use BeautifulSoup or Jupyter Notebook

Clone the Repository

Install Dependencies

LICENSE

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages