Twitter Suicidal Ideation Detection using NLP and ML/DL Models

This project aims to develop a system that can detect suicidal ideation on Twitter using Natural Language Processing (NLP) and Machine Learning (ML) models, including Logistic Regression, Support Vector Machines (SVM), Random Forest (RF), Multinomial Naive Bayes (MNB), Ensemble Learning, AdaBoost, Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Convolutional Neural Networks (CNN), and Bidirectional Encoder Representations from Transformers (BERT). This project aims to identify individuals who may be at risk of suicide and contribute to suicide prevention efforts.
It also includes a Flask web application for real-time predictions.

📦 Dataset:

I've used the Twitter Suicide Dataset, which contains Tweets from individuals who have experienced suicidal thoughts and ideations.
The dataset includes more than 1787 Tweets and comments, and it has been labeled as either indicating suicidal ideation or not.

⚙️ Technical Approach

Data Preprocessing:

As part of the data preprocessing phase, First I converted Tweets to lower case then cleaned and filtered the text by removing URLs, stop words, and special characters.
Next, I tokenized the text and performed stemming and lemmatization to reduce the words to their base form.
This step helped in reducing the dimensionality of the data and improved the efficiency of the models.

Feature Extraction:

For feature extraction, I used the Term Frequency-Inverse Document Frequency (TF-IDF) technique.
Additionally, I also experimented with custom embeddings to capture semantic meaning and reduce the dimensionality of the data.
These techniques helped in representing the text in a numerical format that could be used by the models to make predictions.

Machine Learning Models:

During the experimentation phase, I trained several ML models including Logistic Regression, Support Vector Machines (SVM), Random Forest (RF), Multinomial Naive Bayes (MNB), Ensemble Learning, and AdaBoost.
These models were used to make predictions on the test set and identify individuals who may be at risk of suicide.

Deep Learning Models:

In addition to the ML models, I also experimented with several DL models such as LSTM, GRU, CNN, and BERT.
To improve the performance of the BERT model, I fine-tuned it using the ktrain library.
By fine-tuning the pre-trained BERT model, I was able to capture the context and nuances of the text better, resulting in higher accuracy and F1 scores.
These DL models were used alongside the ML models to identify individuals who may be at risk of suicide.

📈 Results:

After evaluating the performance of all the models, the best-performing ML model was the ensemble learning model, which achieved an F1 score of 88.67% on the test set.
On the other hand, best-performing DL models were LSTM-2 Layer and BERT, which achieved 94.18% and 96.00% accuracy, respectively.
These results demonstrate the effectiveness of using NLP and ML/DL models to detect suicidal ideation on Twitter and highlight the potential for these models to contribute to suicide prevention efforts.

🛠️ Web Application Interface

🌟 Conclusion:

My study demonstrates the feasibility of using NLP and ML/DL models to detect suicidal ideation on Twitter. The models I developed achieved high accuracy and F1 scores, indicating their potential usefulness in identifying individuals who may be at risk of suicide. These findings suggest that NLP and ML/DL models have the potential to contribute to suicide prevention efforts by identifying individuals who may need help and support.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Dataset		Dataset
Web Application		Web Application
ui		ui
README.md		README.md
Twitter_Suicidal_Ideation_Detection.ipynb		Twitter_Suicidal_Ideation_Detection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Twitter Suicidal Ideation Detection using NLP and ML/DL Models

📦 Dataset:

⚙️ Technical Approach

Data Preprocessing:

Feature Extraction:

Machine Learning Models:

Deep Learning Models:

📈 Results:

🛠️ Web Application Interface

🌟 Conclusion:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Snigdho8869/twitter-suicidal-ideation-detection

Folders and files

Latest commit

History

Repository files navigation

Twitter Suicidal Ideation Detection using NLP and ML/DL Models

📦 Dataset:

⚙️ Technical Approach

Data Preprocessing:

Feature Extraction:

Machine Learning Models:

Deep Learning Models:

📈 Results:

🛠️ Web Application Interface

🌟 Conclusion:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages