Skip to content

An AI-powered medical consultation assistant that combines voice recognition, natural language processing, and retrieval-augmented generation (RAG) to provide accurate medical information through an intuitive voice interface.

Notifications You must be signed in to change notification settings

chouket0102/Medical-Assistant

Repository files navigation

🏥 Medical Voice Assistant

Python FastAPI License RAG

An AI-powered medical consultation assistant that combines voice recognition, natural language processing, and retrieval-augmented generation (RAG) to provide accurate medical information through an intuitive voice interface.

🌟 Features

  • 🎤 Voice-to-Voice Interaction: Real-time speech recognition and text-to-speech responses
  • 🔍 RAG-Powered Responses: Retrieval-Augmented Generation using medical literature
  • 📚 Medical Knowledge Base: Built on comprehensive medical PDFs and documentation
  • 🌐 Modern Web Interface: Clean, responsive UI with real-time status updates
  • ⚡ WebSocket Communication: Low-latency real-time audio streaming
  • 🎯 High Accuracy: Powered by Pinecone vector database and Hugging Face embeddings
  • 📱 Mobile Responsive: Works seamlessly across all devices

🚀 Demo

Medical Voice Assistant Demo

Real-time voice interaction with the medical assistant

🏗️ Architecture

graph TB
    A[User Voice Input] --> B[WebSocket Connection]
    B --> C[Speech Recognition]
    C --> D[Medical Agent]
    D --> E[RAG System]
    E --> F[Query Embedding]
    F --> G[HuggingFace Embeddings]
    G --> H[Pinecone Vector DB]
    H --> I[Retrieved Medical Documents]
    I --> J[Context + Query]
    J --> K[LLM Processing]
    K --> L[Generated Response]
    L --> M[Text-to-Speech]
    M --> N[Audio Response]
    N --> B
    B --> O[User Audio Output]
    
    style A fill:#e1f5fe
    style O fill:#e8f5e8
    style H fill:#fff3e0
    style K fill:#f3e5f5
Loading

About

An AI-powered medical consultation assistant that combines voice recognition, natural language processing, and retrieval-augmented generation (RAG) to provide accurate medical information through an intuitive voice interface.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published