Skip to content
This repository was archived by the owner on Apr 11, 2025. It is now read-only.

Final project for 50.040 Natural Language Processing course taken in Fall 2024, built with Ansh Oswal and Elvern Neylman Tanny.

License

Notifications You must be signed in to change notification settings

T2LIPthedeveloper/50.040-NLP-Final-Project

Repository files navigation

50.040 Natural Language Processing - Final Project - Hybrid CNN-RNN + Attention Sentiment Analysis

This project implements a sentiment analysis model using a Hybrid CNN-RNN architecture with an Attention Mechanism for improved performance. The model is trained on the IMDb movie reviews dataset and leverages pre-trained GloVe embeddings for semantic understanding.

Repository Location

The dataset repository for this project is hosted on GitHub and can be accessed via the following link:

GitHub Repository

This repository contains all the code, dataset files, and results necessary to replicate the project and analyze the performance of various models.


Setup Instructions

Prerequisites

  • Python 3.8+
  • Jupyter Notebook
  • Required Python Libraries (Install via requirements.txt):
    pip install -r requirements.txt

Steps to Run

  1. Clone the repository:

    git clone https://github.com/T2LIPthedeveloper/50.040-NLP-Final-Project
    cd 50.040-NLP-Final-Project
  2. Install required libraries:

    pip install -r requirements.txt
  3. Open the Python environment and run the following:

    python sentencesenseis_script.py

Models and Results

The results for the sentiment analysis task using various models are summarized below:

Hybrid-CNN-RNN with Attention

  • Accuracy: 0.9472
  • Precision: 0.9464
  • Recall: 0.9480
  • F1 Score: 0.9472

TextCNN

  • Accuracy: 0.8476
  • Precision: 0.8132
  • Recall: 0.9026
  • F1 Score: 0.8556

BiRNN

  • Accuracy: 0.8320
  • Precision: 0.9116
  • Recall: 0.7352
  • F1 Score: 0.8140

Performance Metrics

For detailed performance metrics, refer to the result files:


Contributors

This project was developed as part of the 50.040 Natural Language Processing course at SUTD. The group members are:

  • Ansh Oswal (1006265)
  • Atul Parida (1006184)
  • Elvern Neylman Tanny (1006203)

Acknowledgements

We would like to thank the following:

  • Stanford AI Group for providing the IMDb dataset.
  • SUTD and Professor XX for their guidance and resources for the 50.040 NLP course.

About

Final project for 50.040 Natural Language Processing course taken in Fall 2024, built with Ansh Oswal and Elvern Neylman Tanny.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •