This project has been my first experience of training a model. Using Support Vector Machines (SVM) to create a prediction model for sentiment analysis on a dataset of Amazon reviews. It has simple data preprocessing, feature extraction, model training, and evaluation.
This project uses SVM to classify Amazon product reviews into positive or negative sentiments. The workflow includes:
- Loading and preprocessing the dataset
- Visualizing the distribution of sentiments
- Vectorizing the text data using bigrams
- Identifying the most common words
- Training an SVM model
- Evaluating the model's accuracy
Necessary packages:
pandas matplotlib nltk scikit-learn