Classifying and Visualizing trends from Food-related Twitter tweets using Machine Learning and Spark
Data pre-processing, classification, and visualizations are included
Some technologies used:
- Python/R
- Apache Spark
Machine Learning Techniques:
- Natural Lanaguage Processing (NLTK)
- Sentiment Analysis
- 10-fold Cross Validation
- Support Vector Machines (SVM)
- Naive Bayes
- Linear Discriminant Analysis (LDA) for Data Mining
Collaborators:
- Andy Wu
- Aaron Leung
- Tanner Litwin