🏥 Medical LLM App (Voice/Text Input with Online/Offline Modes)

This is a powerful and privacy-conscious medical assistant app that uses both cloud-based and offline AI models to answer user questions via voice or text. The app supports Hindi and English output, uses Claude 3.5 via AWS Bedrock for high-quality online answers, and falls back to BioGPT + offline TTS/STT for offline use.

✅ How It Works (Working First)

The app accepts voice or text input and returns responses via voice (TTS) or text.
With an internet connection, it uses:
- 🎙️ OpenAI Whisper (STT)
- 🤖 Claude 3.5 via AWS Bedrock for responses
- 📚 RAG (Retrieval Augmented Generation) with Pinecone
- 🗣️ Amazon Polly for voice output
- 📝 Textract & Translate (for OCR and multilingual support)
Without internet, it runs completely offline using:
- 🧠 BioGPT (Q5_K_M GGUF) for LLM responses
- 🧾 easyOCR (OCR)
- 🗣️ Glow-TTS (TTS)
- 🧏‍♂️ faster-whisper-base.en (STT)
The system:
- Uses chat history as part of the context to improve response quality.
- Saves conversations locally by session.
- Can analyze uploaded files or images (OCR + language detection + summarization).
- Automatically chooses online or offline mode depending on connectivity.

🏗️ Architecture Overview

🛠️ Technologies Used

Area	Online Tools	Offline Tools
LLM	Claude 3.5 (AWS Bedrock)	BioGPT (Q5_K_M, GGUF via llama.cpp)
STT	OpenAI Whisper API	faster-whisper-base.en
TTS	Amazon Polly	Glow-TTS
OCR	AWS Textract	easyOCR
RAG	Pinecone (Free Tier)	Local fallback context only
Language	AWS Translate / Langdetect	Langdetect
Chat History	File-based JSON session storage	Same
API Backend	Flask + CORS	Flask
Frontend	React (Voice UI, file input, etc.)	Same

📦 Datasets Used for RAG

The vector search backend is populated using the following medical QA datasets:

PubMedQA (qiaojin/PubMedQA)
- Both pqa_labeled and pqa_unlabeled versions
- Used final_decision, context, and long_answer fields
MedQuad (keivalya/MedQuad-MedicalQnADataset)
- Includes curated question-answer pairs from multiple medical sources

Embedding Model

all-MiniLM-L6-v2 via Hugging Face
Sentences chunked, embedded, and upserted into Pinecone vector DB

🧠 Model Metadata (YAML in Markdown)

license: apache-2.0
tags:
  - medical
  - RAG
  - offline
  - whisper
  - bedrock
  - biogpt
  - TTS
  - STT
  - Hindi
datasets:
  - qiaojin/PubMedQA
  - keivalya/MedQuad-MedicalQnADataset
language:
  - en
  - hi
model:
  - BioGPT Q5_K_M GGUF
  - Claude 3.5 (Bedrock)
  - all-MiniLM-L6-v2 (embeddings)
stt:
  - OpenAI Whisper API
  - faster-whisper-base.en
tts:
  - Amazon Polly
  - Glow-TTS
ocr:
  - AWS Textract
  - easyOCR
vector_store:
  - Pinecone (free-tier)
api:
  - Flask (backend)
  - React (frontend)

🚀 Installation & Setup

📁 1. Clone the Repository

git clone https://github.com/PranavShashidhara/MediAssist_AI.git

🧱 2. Set Up Python Environment (Backend)

cd backend
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

🌐 3. Configure Environment Variables

Create a .env file in the backend/ directory with the following content:

PINECONE_API_KEY=your-pinecone-key
OPENAI_API_KEY=your-openai-key
HF_TOKEN=your-hugging-face-key

💡 For offline mode, you can leave API keys blank — the system will switch to local models automatically.

🧠 4. Download Local Models (For Offline Mode)

Ensure the following are downloaded and placed correctly:

🧠 BioGPT Q5_K_M GGUF in a folder like models/biogpt
🧏‍♂️ faster-whisper-base.en (STT model)
🗣️ Glow-TTS for offline text-to-speech

These models may auto-download on first use, but pre-downloading improves performance.

🧪 5. Run Backend Server

cd aws_medical_llm
python main.py

This starts your Flask backend on http://localhost:5000.

🎛️ 6. Frontend Setup (React)

cd my-chat-app
npm install
npm start

This runs the React app at http://localhost:3000.

🧪 How to Use

Open the app in your browser.
With internet: ask questions via voice or text in English or Hindi.
Without internet: disconnect Wi-Fi, and the app automatically switches to offline mode.
Upload medical images or prescriptions for OCR + question-answering.
All sessions are stored locally for review or debugging.

🙌 Acknowledgments

AWS Bedrock
Hugging Face
Pinecone
Microsoft BioGPT
Glow-TTS
Faster-Whisper
Textract

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
aws_medical_llm		aws_medical_llm
my-chat-app		my-chat-app
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🏥 Medical LLM App (Voice/Text Input with Online/Offline Modes)

✅ How It Works (Working First)

🏗️ Architecture Overview

🛠️ Technologies Used

📦 Datasets Used for RAG

Embedding Model

🧠 Model Metadata (YAML in Markdown)

🚀 Installation & Setup

📁 1. Clone the Repository

🧱 2. Set Up Python Environment (Backend)

🌐 3. Configure Environment Variables

🧠 4. Download Local Models (For Offline Mode)

🧪 5. Run Backend Server

🎛️ 6. Frontend Setup (React)

🧪 How to Use

🙌 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

PranavShashidhara/MediAssist_AI

Folders and files

Latest commit

History

Repository files navigation

🏥 Medical LLM App (Voice/Text Input with Online/Offline Modes)

✅ How It Works (Working First)

🏗️ Architecture Overview

🛠️ Technologies Used

📦 Datasets Used for RAG

Embedding Model

🧠 Model Metadata (YAML in Markdown)

🚀 Installation & Setup

📁 1. Clone the Repository

🧱 2. Set Up Python Environment (Backend)

🌐 3. Configure Environment Variables

🧠 4. Download Local Models (For Offline Mode)

🧪 5. Run Backend Server

🎛️ 6. Frontend Setup (React)

🧪 How to Use

🙌 Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages