AudiHealth is a machine learning-based application designed to detect voice pathologies, specifically vocal polyp and laryngitis, using audio data. Leveraging the power of VGGish for feature extraction and a Liquid State Machine (LSM) model for classification, AudiHealth aims to provide early detection capabilities, potentially improving patient outcomes.
- Introduction
- Features
- Getting Started
- Prerequisites
- Installation
- Usage
- Contributing
- License
- Contact
- Acknowledgments
Voice pathologies, such as vocal polyp and laryngitis, can significantly impact a person's quality of life. Early detection is crucial, but it often faces challenges due to the complexity of voice analysis. AudiHealth addresses this by utilizing advanced machine learning techniques to analyze audio data and detect these pathologies with high accuracy.
- VGGish Feature Extraction: Utilizes the VGGish model for extracting rich audio features.
- Liquid State Machine (LSM) Model: A custom model architecture for pathology detection.
- Data Preprocessing: Includes SMOTE for handling class imbalance.
- Performance Metrics: Provides detailed performance metrics for evaluation.
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.
- Python 3.7+
- TensorFlow 2.x
- Librosa
- TensorFlow Hub
- Clone the repository:
- Navigate to the project directory:
- Install the required packages:
To use AudiHealth for voice pathology detection, follow these steps:
- Prepare your audio data in the required format(.wav).
- Run the feature extraction script:
- Train the model using the extracted features:
- Use the trained model to predict voice pathologies:
Contributions are welcome! Please read the CONTRIBUTING.md for details on how to contribute to this project.
This project is licensed under the MIT License - see the LICENSE.md file for details.
- Krishnan Vignesh - krishnanvignesh148@gmail.com
- Sahil Brid - bridsahil007@gmail.com
Project Link: https://github.com/Krish0909/AudiHealth
- VGGish for feature extraction.
- TensorFlow Hub for the LSM model.
- All contributors to this project.