This repository contains the LaTeX code for the final report of an applied machine learning project that performs sentiment analysis on the Sentiment140 dataset.
The project utilizes various data cleaning techniques and classification models, such as Logistic Regression and Support Vector Machines (SVM), to analyze public sentiment in Twitter data.
The dataset consists of 1.6 million labeled tweets, and the report includes an in-depth exploration of the dataset, the preprocessing steps taken, the models applied, and the evaluation results.
The code repository can be found here: [https://github.com/adhillon192/SimpleMLProject]