This repository contains the complete files for my final year project, which focuses on sentiment analysis using machine learning, deep learning, and transformer-based models. The goal is to analyze mental health-related social media posts and classify them based on emotional and psychological states.
. ├── code/ # All model training and prediction notebooks └── dataset/ # Datasets used in this project (from Kaggle)
This directory contains all experiment-related files, including model training and prediction generation notebooks. It is organized into three subfolders: Dataset1
, Dataset2
, and Dataset3
.
- Format:
Model Name (Model Training).ipynb
- Includes training code for traditional ML, CNN-BiLSTM, and BERT models.
- Notably,
D1.ML.ipynb
trains four classical machine learning models.
- Format:
Model Name (Generate Prediction CSV).ipynb
- These notebooks load trained models, perform inference, and export results to
.csv
.
Each output CSV contains:
- Input text
- Ground truth label
- Predicted label
Models evaluated:
Extra Trees Classifier
, CNN-BiLSTM
, and BERT
This folder includes all datasets used for model training and evaluation. All datasets were sourced from Kaggle and are provided in .csv
format for convenience.
-
Sentiment Analysis for Mental Health
📎 https://www.kaggle.com/datasets/suchintikasarkar/sentiment-analysis-for-mental-health -
Mental Health [Sentiment Analysis] Data
📎 https://www.kaggle.com/datasets/sujaykapadnis/mental-health-insights-data -
Mental Health Support Feature Analysis
📎 https://www.kaggle.com/datasets/thedevastator/mental-health-support-feature-analysis
⚠️ Note: Only a subset of this dataset was used in the project.
- Python
- Jupyter Notebook
- Scikit-learn
- Keras (TensorFlow)
- Hugging Face Transformers
- Pandas, NumPy, Matplotlib
- Text Preprocessing
- Model Training (Traditional ML,CNN,BiLSTM, CNN-BiLSTM, BERT, BERT-BiLSTM)
- Prediction & Evaluation
- CSV Output Generation
This project is for academic and research purposes only. Datasets belong to their respective Kaggle contributors.