GitHub

Project Information

Team Members: Simrun Mutha & Melody Chiu

The goal of this CoffeeFinder project is to find what makes each coffee brand unique in terms of flavor. In order to do this, our project involves four major portions of code:

Data Collection This involves scraping all the reviews for an Amazon product through its All Reviews page. The code for this portion can be found in scraping_reviews.py
Data Processing This involves processing the data scraped from the Amazon site. The reviews are sorted through to find adjectives that relate to coffee flavor profiles. The code for this portion can be found in processing_data.py
Data Visualization This involves illustrating the words output from processing and their frequency. The code for this portion can be found in visualizing_data.py
Loading This involves loading the information about the coffee brands and frequent nouns in Amazon reviews that are unrelated to coffee flavor. The code for this portion can be found in loading.py

Installing Libraries

For the project we used various libraries in order to carry out various tasks within our project. Two of these libraries are nltk and WordCloud. To install nltk, files were downloaded from this link: http://pypi.python.org/pypi/nltk and then the command import nltk was carried out at the top of the python file. To install WordCloud, the command pip install “pip install wordcloud” has to be carried out in the bash terminal and then the line from wordcloud import WordCloud, STOPWORDS has to be run at the top of any python file.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
CoffeeFinder Computational Essay.ipynb		CoffeeFinder Computational Essay.ipynb
Community		Community
Folgers		Folgers
Koffee Kult		Koffee Kult
Maxwells House		Maxwells House
Peets		Peets
README.md		README.md
Ravens		Ravens
Seattles Best		Seattles Best
Yuban		Yuban
brands.txt		brands.txt
loading.py		loading.py
nouns-to-disregard.txt		nouns-to-disregard.txt
processing_data.py		processing_data.py
scraping_reviews.py		scraping_reviews.py
test.py		test.py
test_processing_data.py		test_processing_data.py
visualizing_data.py		visualizing_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Information

Team Members: Simrun Mutha & Melody Chiu

Installing Libraries

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

simrunm/CoffeeFinder

Folders and files

Latest commit

History

Repository files navigation

Project Information

Team Members: Simrun Mutha & Melody Chiu

Installing Libraries

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages