Skip to content

carloslago/SentimentClassification

Repository files navigation

SentimentClassification

Table of Contents

About The Project

This is a university project developed for the "Data Science" course at KTH, for this project we aimed to perform sentiment classification of social media comments in swedish using semi-supervised learning. The final report can be seen here.

Built With

Contributors

Carlos Lago
Carlos Lago

Xuecong Liu
Xuecong Liu

Zhenlin Zhou
Zhenlin Zhou

Xuecong Liu
Eliott Remmer

Getting Started

To get a local copy up and running follow these simple steps.

Prerequisites

  • Python
sudo apt install python3 python3-dev

Installation

  1. Clone the repo
git clone https://github.com/carloslago/SentimentClassification.git
  1. Install Python packages
sudo pip install -r requirements.txt

Usage

The project fine-tunes a BERT model with social media comments with both supervised learning and unsupervised learning (GAN-BERT).

Data should have the following format, for unlabelled data the sentiment label doesn't need to be present.

{"sentiment_label": "-1/1", "message": "This is a comment in swedish"}

Fine-tuning swedish BERT:

python bert_fine_tuning.py

Training GAN-BERT with unsupervised data:

python gan_bert.py

Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

Contact

Feel free to send us any bug reports, ask any questions or request any features via email. Just keep in mind we did this as a university project.

About

Sentiment classification of social media comments in swedish using semi-supervised learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages