MindSight: Facial Emotion Recognition for Mental Health Assessment

Overview

MindSight is an advanced AI-powered application that combines facial emotion recognition technology with standardized psychological assessments to provide comprehensive mental health insights. Using a deep learning model architecture based on EfficientNet-B0 with Transformer layers, the system can detect and analyze seven basic emotions in real-time: anger, disgust, fear, happiness, neutral, sadness, and surprise.

Key Features

Real-time Emotion Detection: Process webcam feed to identify facial expressions and emotional states
Mental Health Assessment: Integrates emotional analysis with standardized psychological questionnaires including PHQ-9 (depression) and GAD-7 (anxiety)
Clinical Dashboard: Professional UI designed for healthcare practitioners
Comprehensive Reporting: Generates detailed assessment reports with visualizations
Containerized Deployment: Easily deploy with Docker in various environments

Screenshots

Screenshot	Description
	The clinical dashboard provides an intuitive interface for practitioners
	The assessment interface combines real-time emotion recognition with psychological questionnaires

Technical Details

Architecture

Base Model (model.py): EfficientNet-B0 backbone with custom Transformer blocks
Improved Model (model_2.py): Enhanced architecture with:
- Multi-scale feature extraction using feature pyramid (3-layer extraction)
- Spatial attention mechanism to focus on relevant facial features
- Enhanced transformer with 8 attention heads and 3 transformer blocks (increased from 4 heads and 2 blocks)
- Deeper MLP head with GELU activation and dropout regularization (0.2 dropout rate)
- Focal Loss implementation (α=1, γ=2) for handling class imbalance
Training: Cross-validation, data augmentation, class oversampling and focal loss techniques
Performance: As for now, we achieved ~67% accuracy on the 7-class emotion classification task. We are still experimenting with the training pipeline to achieve the best result
Inference: Real-time processing with support for CPU and GPU/MPS hardware acceleration

Tech Stack

Frontend: Streamlit for interactive web interface
Backend: Python 3.9+, PyTorch 2.0+
Computer Vision: OpenCV for real-time video processing
Containerization: Docker for consistent deployment
Visualization: Matplotlib, Seaborn, Plotly for data visualization

Getting Started

Option 1: Using Docker

# Clone the repository
git clone https://github.com/p1sangmas/MindSight.git
cd MindSight

# Start the application with Docker
./start.sh

# Access the dashboard at http://localhost:8501

Option 2: Local Installation (Recommended for webcam functionality)

# Clone the repository
git clone https://github.com/p1sangmas/MindSight.git
cd MindSight

# Create and activate virtual environment
python -m venv env
source env/bin/activate  # On Windows: env\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Run the application
streamlit run src/dashboard_app.py

Camera Configuration

For optimal webcam performance, please note:

OpenCV is used for webcam capture
When running in Docker, camera device mapping must be properly configured
Browser permissions must be granted for camera access
See webcam-setup.md for detailed instructions for your specific OS

Training Custom Models

The project includes scripts for training custom emotion recognition models. You can train either the base model or the improved model with various configurations:

# Train the base model with default parameters
python src/train.py --data_dir data --model_version original

# Train the improved model with focal loss
python src/train.py --data_dir data --model_version improved --focal_loss --model_folder my_custom_model

# Train the improved model with class weights
python src/train.py --data_dir data --model_version improved --class_weights 1.5 2.0 1.5 0.8 1.2 1.5 1.0 --model_folder my_custom_model

# Train with custom configurations (e.g., for 50 epochs)
python src/train.py --data_dir data --model_version improved --focal_loss --model_folder my_custom_model --num_epochs 50 --batch_size 32

Evaluation and Results

# Evaluate a trained model
python src/evaluate.py --model_path checkpoints/model_name/best_model.pth

Each checkpoint directory will contains:

best_model.pth: The trained model weights
classification_report.txt: Detailed metrics including precision, recall, and F1-score
confusion_matrix.png: Visualization of model performance across emotion classes
train_log.txt: Complete training history

Project Structure

├── src/                  # Source code
│   ├── model.py          # Base model architecture definition
│   ├── model_2.py        # Improved model with advanced features
│   ├── train.py          # Training pipeline
│   ├── evaluate.py       # Evaluation script
│   ├── dashboard_app.py  # Streamlit dashboard application
│   ├── dashboard_utils.py # Utilities for the dashboard
│   ├── questionnaire.py  # Psychological assessment questionnaires
│   └── data_preprocessing.py # Data preprocessing utilities
├── data/                 # Training and testing datasets
│   ├── train/            # Training images organized by emotion classes
│   └── test/             # Test images organized by emotion classes
├── checkpoints/          # Saved model weights for various experiments
│   ├── model_name/       # Trained model
├── runs/                 # TensorBoard logs for training monitoring
├── assets/               # Sample images and screenshots
└── docker-compose.yml    # Docker configuration

Troubleshooting

If you're having issues with camera access:

Browser Permissions:
- Check that your browser has permission to access your camera
- Try using Chrome or Firefox (they have better webcam support)
- Look for the camera icon in your browser's address bar
Docker-specific issues:
- Browser-based camera access requires explicit permission
- If the permission dialog doesn't appear, try clicking the "Request Camera Permission" button
- See the Docker configuration in docker-compose.yml for your specific OS

License

This project is licensed under the MIT License. See the LICENSE file for details.

Developed by Fakhrul Fauzi, Zikry Zaharudin and Saiful Azree

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
notebooks		notebooks
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
docker-guide.md		docker-guide.md
requirements.txt		requirements.txt
start.sh		start.sh
webcam-setup.md		webcam-setup.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MindSight: Facial Emotion Recognition for Mental Health Assessment

Overview

Key Features

Screenshots

Technical Details

Architecture

Tech Stack

Getting Started

Option 1: Using Docker

Option 2: Local Installation (Recommended for webcam functionality)

Camera Configuration

Training Custom Models

Evaluation and Results

Project Structure

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

p1sangmas/MindSight

Folders and files

Latest commit

History

Repository files navigation

MindSight: Facial Emotion Recognition for Mental Health Assessment

Overview

Key Features

Screenshots

Technical Details

Architecture

Tech Stack

Getting Started

Option 1: Using Docker

Option 2: Local Installation (Recommended for webcam functionality)

Camera Configuration

Training Custom Models

Evaluation and Results

Project Structure

Troubleshooting

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages