SmoothTalk

Sign language interpreter, to switch from a video signal to a corresponding word. This machine-learning project, powered by PyTorch, aims to develop a sign language recognition system.

⚠️This project is the result of a common work, made for school with a deadline and we chose to leave it as it was at the time of submission. This project was developed in collaboration with Sia Partners.

Project date : 2024

Summary

Summary
Dependencies
Training
Testing
Streamlit
GPU management
Utils folder
Tests
Credits

Dependencies

Training

You need to download a dataset of images (The appropriate amount of images remains to be defined. In our case it was around 150,000.). You can find a lot of datasets on Kaggle. The dataset must respect the format in Dataset/T3_Echantillon.zip. You can use the different python scritps in utils to manage datasets. More information in the utils folder
Then, in the model_trainer.ipynb file, specify the path to the dataset
Set up CUDA explained below, to use the GPU and boost process speed
Run the model_trainer.ipynb file
Utilisez la version model_trainer_colab.ipynb si vous souhaitez utiliser l'interface Google Colab

Testing

Specify the path to the .pth file generated in the code real_time_translation.py
Run the real_time_translation.py file to load the model and use your webcam

Streamlit

Streamlit is an open source framework that enables you to easily create interactive web applications for machine learning and data science, simply using Python code.

This allows you to leave your IDE using the real_time_translation.py code, and have a proper interface.

Install streamlit library
```
$ pip install streamlit
```
Place files streamlit_app.py and recognition_model_streamlit.py in the same project
Run the streamlit_app.py file with the command
```
$ streamlit run streamlit_app.py
```

A streamlit web page will then appear, allowing you to use the previously generated model.

GPU management

Google Colab - Tensorflow or PyTorch

For this project, Google colab is a very useful tool for processing our data in our machine learning pipeline.
In fact, Colab includes a gpu, enabling us to process our data more quickly and simply.

Usage

Import code (jupyter file) into colab
Upload a dataset in zip format to the Google Drive - associated with the google account used on Colab.
Change path in code and log in
Respect the tree structure

Cuda Toolkit - Pytorch

An alternative to Google Colab is the Cuda Toolkit. Cuda is used in particular with Pytorch. Here, you can use your own computer gpu. The advantage is that we are not limited by the free version of Google Colab.

Usage

Python - We recommend installing the 3.10.7 Version
Cuda Toolkit - Install version 11.7.0 - exe installer
Pytorch - Version 1.13.1. Install the version corresponding to the CUDA version on the website

Command path to enter in cmd

$ pip install torch==1.13.1+cu117 torchvision==0.14.1+cu117 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu117

Installation check : cmd -> python -> import torch -> torch.cuda.device_count() -> Should display "1"

Utils folder

Data_Augmentation.py - Augment dataset data/images (blur, flip, contrast).
Delete_Dataset_Images.py - Delete a certain amount of images from a dataset.
EchantillonDataset.py - Export a dataset sample to reduce its size.
Generer_Dataset.py - Interface allowing you to fill a dataset file with the letters of the alphabet, from the webcam. Press "c" to capture an image and "n" to change letters.
data_augment_V2.py - Augment dataset data/images (blur, flip, contrast), with independent selection of quantity of images per class.
merge_dataset_v2.py - Place images from different datasets in a single file, respecting the tree structure, with independent selection of quantity of images per class.
merge_different_dataset.py - Place images from different datasets in a single file, respecting the tree structure.

Tests

Streamlit Interface

Letter T

Letter C

Credits

Lorenzo : Co-creator of the project.
Mathéo : Co-creator of the project.
Clement Auray : Co-creator of the project.
Evann Ali-Yahia : Co-creator of the project.
Thomas : Co-creator of the project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SmoothTalk

Summary

Dependencies

Training

Testing

Streamlit

GPU management

Google Colab - Tensorflow or PyTorch

Usage

Cuda Toolkit - Pytorch

Usage

Utils folder

Tests

Credits

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
Dataset		Dataset
SmoothTalk		SmoothTalk
Tensorflow		Tensorflow
python		python
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model_trainer.ipynb		model_trainer.ipynb
model_trainer_colab.ipynb		model_trainer_colab.ipynb
real_time_translation.py		real_time_translation.py
recognition_model_streamlit.py		recognition_model_streamlit.py
streamlit_app.py		streamlit_app.py

License

MrZouu/SmoothTalk

Folders and files

Latest commit

History

Repository files navigation

SmoothTalk

Summary

Dependencies

Training

Testing

Streamlit

GPU management

Google Colab - Tensorflow or PyTorch

Usage

Cuda Toolkit - Pytorch

Usage

Utils folder

Tests

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages