Multimodal AI Framework for Social Media Based Mental Disorder Detection and Personalized Wellbeing Insights

Abstract

This project leverages artificial intelligence to detect early signs of mental health issues by analyzing multimodal data from social media platforms. By processing text, images, videos, audio, and PDFs, the system uses advanced machine learning and deep learning models to classify mental disorders such as depression, anxiety, bipolar disorder, and PTSD. The framework also generates personalized wellbeing insights by mapping mental health cues to Ryff’s Wellbeing Scale. With an ensemble of algorithms—including Logistic Regression, SVM, Naïve Bayes, LSTM, Transformer, and XGBoost—the system achieves high accuracy and robustness while enabling real-time web-based analysis via Streamlit.

Features

🔍 Multimodal Analysis: Processes text, images, videos, audio, and PDFs.
📊 Data-Driven Insights: Uses ensemble learning for robust classification.
⚙️ Advanced Preprocessing: Cleans and tokenizes raw social media data.
🧠 Diverse ML Models: Implements Logistic Regression, Naïve Bayes, SVM, Random Forest, and XGBoost.
🤖 Deep Learning Models: LSTM-based and Transformer-based models for sequential and contextual data.
🌐 Web Application: Deployable via Streamlit for real-time analysis.
📈 Performance Evaluation: Extensive testing with cross-validation, ROC curves, and confusion matrices.
📋 Wellbeing Mapping: Uses an association matrix and a brief wellbeing survey to map predictions to Ryff’s Psychological Wellbeing dimensions. It highlights areas (e.g., purpose, autonomy) users should focus on.
🧠 AI-Powered Wellbeing Insights: Uses Gemini API to generate personalized wellbeing insights based on predictions.
🔁 RAG-Style Insights: Employs Retrieval-Augmented Generation using dynamic datasets and similarity-based input matching—with Gemini—for scalable and explainable wellbeing analysis.

Tech Stack and Libraries

Programming Language: Python 3.x
Frameworks & Environments:
- TensorFlow & Keras
- Streamlit
- Google Colab (for GPU-based training)
Key Libraries:
- Data Processing: pandas, numpy
- Machine Learning: scikit-learn, xgboost
- Deep Learning: tensorflow, keras, transformers (Hugging Face)
- NLP: nltk, tiktoken
- Image & Video Processing: opencv-python, pytesseract, deepface, ffmpeg
- Audio Processing: librosa, pydub, SpeechRecognition
- Social Media Integration: PRAW (Reddit), tweepy (Twitter)
- Visualization: plotly, matplotlib
- Other Tools: joblib, protobuf, deep-translator, google-generativeai

How to run ?

⚠️ Note: To create a Reddit API Client ID and Secret Key, visit
https://old.reddit.com/prefs/apps/
and follow the steps below.

Step 1:

Step 2:

Step 3:

📌 Note: To use Google AI Studio features, create a Google API Key
👉 Visit https://aistudio.google.com/apikey and create an API key.

🚨 Note: To access Twitter (X) API features, you must create a Bearer Token

👉 First, create a profile on X
👉 Then open https://developer.x.com/en/portal/dashboard in a new tab and follow the steps below.

Step 1:

Step 2: GENERATE BEARER TOKEN AND SAVE IT

🔐 Note: To use Ngrok for tunneling your application online:

Login using your GitHub account at https://ngrok.com/

Then generate your Ngrok Auth Token and Static Domain using the steps below.

Step 1: Generate Auth Token

Step 2: Create Static Domain

Unzip the Pickles and Required Files and Samples
Upload WebAppV13-7.ipynb in drive and open it in Google Colab
Replace the API keys for Reddit, Gemini, Twitter and Ngrok

⚠️ Warning: The cross_encoder_gpu.pkl file inside the "Pickles and Required Files" may become incompatible if Google Colab updates its environment.

It is recommended to regenerate it using the following code:
from sentence_transformers import CrossEncoder
import pickle

# Load the cross-encoder model (use device='cuda' if using GPU runtime)
cross_encoder_model = CrossEncoder('cross-encoder/ms-marco-MiniLM-L-6-v2', device='cuda')

# Save it to a pickle file
with open("cross_encoder_gpu.pkl", "wb") as f:
    pickle.dump(cross_encoder_model, f)

print("✅ Saved cross_encoder_model to pkl")
✅ Replace the newly generated cross_encoder_gpu.pkl with the one inside the Pickles and Required Files folder to ensure compatibility.
Disconnect and delete the runtime of Google Colab and start by reconnecting again.

Test with sample inputs obtained after unzipping Samples.zip

⚠️ Warning
Due to regular updates from google colab, there can be instances where the LSTM or Transformer based pickle files may lead to errors. It is recommended to rerun the model files, create new pickle files and rerun the Ensemble Model 7 (Base model : LR, SVM, XGB, NB, LSTM, Transformer. Meta Learner : Random Forest) again before running the web application.

Name		Name	Last commit message	Last commit date
Latest commit History 907 Commits
D01 Data Collection		D01 Data Collection
D02 Data Preprocessing		D02 Data Preprocessing
D03 Model Training and Evaluation		D03 Model Training and Evaluation
D04 Testing		D04 Testing
D05 Deployment		D05 Deployment
DATASETS		DATASETS
OTHER FILES		OTHER FILES
PRD LATEX		PRD LATEX
REFERENCES		REFERENCES
SAMPLE INPUT		SAMPLE INPUT
TEMPLATES		TEMPLATES
WEB APP FILES		WEB APP FILES
WEB APP IMAGES		WEB APP IMAGES
WEB APP LANDING PAGE		WEB APP LANDING PAGE
WEB APP REPORTS		WEB APP REPORTS
README.MD		README.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multimodal AI Framework for Social Media Based Mental Disorder Detection and Personalized Wellbeing Insights

Abstract

Features

Tech Stack and Libraries

How to run ?

About

Uh oh!

Releases

Packages

Uh oh!

Languages

arkapg211002/MAFSMBMDDPWI-FYP-2025

Folders and files

Latest commit

History

Repository files navigation

Multimodal AI Framework for Social Media Based Mental Disorder Detection and Personalized Wellbeing Insights

Abstract

Features

Tech Stack and Libraries

How to run ?

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages