This repository showcases implementations of hand tracking and volume control using computer vision and voice control techniques. The project leverages OpenCV and MediaPipe for real-time hand tracking and gesture recognition.
This Jupyter Notebook provides real-time hand tracking using the MediaPipe library, including: β Detecting hand landmarks in real-time. β Identifying finger positions and gestures. β Visualizing hand tracking results with OpenCV.
- π OpenCV (
cv2) - π₯ MediaPipe (
mediapipe) - β‘ NumPy (
numpy)
This Notebook extends the hand tracking module to control system volume using hand gestures: β Detecting hand movements. β Calculating finger distances to adjust volume. β Dynamically controlling system volume.
- π
pycaw(for audio control) - π OpenCV
- π₯ MediaPipe
This directory contains various Python scripts and images for testing and implementing different computer vision functionalities:
π PDF File
- π
Demande_reservation formation.pdf: Sample PDF for OCR testing.
π Python Scripts
- π
exercice.py: Implements an exercise related to image processing. - π
hist.py: Computes and displays an image histogram. - πΌ
imread_imshow.py: Reads and displays an image using OpenCV. - π
ocr.py: Performs OCR (Optical Character Recognition) on an image. - π
read.py: Reads and displays an image. - π
resize.py: Resizes an image with transformations. - πΌ
seui.py: Applies thresholding techniques to an image and visualizes results.
πΌ Sample Images
- πΌ
im.webp,image.png: Sample images used in the project.
To run the notebooks, install the required dependencies using:
pip install opencv-python mediapipe numpy pycaw pytesseract pdf2image1οΈβ£ Clone the repository:
git clone https://github.com/Code-Crafters-BM/Voice_Controll_And_Computer_Vision.git
cd Voice_Controll_And_Computer_Vision2οΈβ£ Open Jupyter Notebook: ( Or navigate on your browerser to google collab : https://colab.research.google.com/ )
jupyter notebook3οΈβ£ Run HandTrackingModule.ipynb to test hand tracking.
4οΈβ£ Run VolumeHandcontrol.ipynb to test volume control.
- Code Crafters Bm β Project development and implementation.
- π Inspired by OpenCV and MediaPipe tutorials.
- π₯ Based on computer vision techniques for gesture recognition.