ClinicalGPT Medical Assistant

A sophisticated medical assistant application that combines large language models with trusted medical sources to provide accurate medical information and analysis.

ClinicalGPT Medical Assistant • 🌟 Features • 🛠️ System Architecture • 🚀 Quick Start ∘ Prerequisites ∘ Installation ∘ Using the Application • 🔧 Configuration ∘ Environment Variables ∘ Trusted Domains • 🎯 Key Components ∘ Server (server/) ∘ Utils (utils/) • 📝 API Reference ∘ Endpoints ∘ Sample Request • 🔐 Security • 🤝 Contributing • 📜 License • 🙏 Acknowledgments

🌟 Features

Intelligent Medical Queries: Get accurate responses to medical questions using state-of-the-art language models
Web Search Integration: Automatic search and validation from trusted medical sources
File Analysis: Process medical documents including:
- Text files (.txt)
- CSV data files
- JSON documents
- Medical images
- PDF documents
Modern Web Interface: Responsive design with real-time feedback
History Management: Track and review past queries and analyses
Multi-Device Support: Intelligent hardware acceleration on:
- NVIDIA GPUs (CUDA)
- AMD GPUs (ROCm)
- Apple Silicon (MPS)
- Intel NPUs
- CPU fallback
Medical Disclaimer: Transparent communication about AI limitations and the informational nature of responses

🛠️ System Architecture

graph TB
    %% Client Layer
    subgraph "Client Layer"
        WI[Web Interface]
        APIC[API Clients]
    end

    %% API Layer
    subgraph "API Layer"
        API[API Endpoints]
        HE[Health Endpoints]
        QE[Query Endpoints]
        FE[File Processing Endpoints]
        ME[Medical Term Detection]
    end

    %% Core Services
    subgraph "Core Services"
        QP[Query Processor]
        FP[File Processor]
        WS[Web Search Integration]
    end

    %% Model Layer
    subgraph "Model Management"
        ML[Model Loader]
        IE[Inference Engine]
        subgraph "Distribution Strategies"
            MP[Model Parallelism]
            PP[Pipeline Parallelism]
            LO[Layer Offloading]
        end
    end
    
    %% Hardware Layer
    subgraph "Hardware Acceleration"
        CUDA[NVIDIA CUDA]
        ROCM[AMD ROCm]
        MPS[Apple Silicon]
        NPU[Intel NPUs]
        CPU[CPU Fallback]
    end

    %% External Services
    subgraph "External Services"
        subgraph "Medical Data Sources"
            MAYO[Mayo Clinic]
            CDC[CDC]
            NIH[NIH]
            WEBMD[WebMD]
            PUBMED[PubMed]
            WHO[World Health Org.]
            REUTERS[Reuters Health] 
        end
        OCR[OCR Services]
        PDF[PDF Processing]
    end

    %% Connections - Client to API
    WI --> API
    APIC --> API
    
    %% API Layer connections
    API --> HE
    API --> QE
    API --> FE
    API --> ME
    
    %% API to Core Services
    QE --> QP
    FE --> FP
    QP --> WS
    
    %% Core Services to Model Management
    QP --> ML
    QP --> IE
    FP --> ML
    FP --> IE
    
    %% Model Management internal connections
    ML --> MP
    ML --> PP
    ML --> LO
    MP --> IE
    PP --> IE
    LO --> IE
    
    %% Hardware Acceleration
    IE --> CUDA
    IE --> ROCM
    IE --> MPS
    IE --> NPU
    IE --> CPU
    
    %% External Services connections
    WS --> MAYO
    WS --> CDC
    WS --> NIH
    WS --> WEBMD
    WS --> PUBMED
    WS --> WHO
    WS --> REUTERS 
    FP --> OCR
    FP --> PDF

sequenceDiagram
    participant User
    participant WebUI as Web Interface
    participant APILayer as API Endpoints
    participant QueryProc as Query Processor
    participant FileProc as File Processor
    participant WebSearch as Web Search Integration
    participant ModelMgmt as Model Management
    participant InfEngine as Inference Engine
    participant DistStrat as Distribution Strategies
    participant HWAccel as Hardware Acceleration
    participant ExtSrc as External Medical Sources

    %% User submits a query
    User->>WebUI: Enters medical query
    WebUI->>APILayer: POST /api/query
    APILayer->>QueryProc: Process query request
    
    %% Web search if enabled
    alt Web search enabled
        QueryProc->>WebSearch: Search for medical information
        WebSearch->>ExtSrc: Query trusted medical websites
        ExtSrc-->>WebSearch: Return medical information
        WebSearch-->>QueryProc: Return search results
        QueryProc->>QueryProc: Enhance prompt with web results
    end
    
    %% Model processing
    QueryProc->>ModelMgmt: Request model inference
    ModelMgmt->>DistStrat: Apply distribution strategy
    
    %% Choose appropriate hardware acceleration
    alt NVIDIA GPU Available
        DistStrat->>HWAccel: Use CUDA acceleration
    else AMD GPU Available
        DistStrat->>HWAccel: Use ROCm acceleration
    else Apple Silicon
        DistStrat->>HWAccel: Use MPS acceleration
    else Intel NPU
        DistStrat->>HWAccel: Use Intel NPU acceleration
    else
        DistStrat->>HWAccel: Use CPU fallback
    end
    
    %% Inference process
    HWAccel-->>InfEngine: Hardware-accelerated processing
    InfEngine-->>ModelMgmt: Return model response
    ModelMgmt-->>QueryProc: Return formatted response
    
    %% Combine results
    QueryProc-->>APILayer: Return combined results
    APILayer-->>WebUI: Return JSON response
    WebUI->>WebUI: Format response with Markdown
    WebUI->>WebUI: Apply medical term highlighting
    WebUI-->>User: Display formatted response

    %% Alternative flow for file upload
    rect rgb(71, 73, 73)
        Note over User,WebUI: File Upload Flow 
        User->>WebUI: Uploads medical file
        WebUI->>APILayer: POST /api/process-file
        APILayer->>FileProc: Process uploaded file
        
        alt PDF Document
            FileProc->>FileProc: Extract text and structure
        else Image File
            FileProc->>FileProc: Perform OCR
        else CSV/JSON
            FileProc->>FileProc: Parse data structure
        else Text File
            FileProc->>FileProc: Process plain text
        end
        
        FileProc->>ModelMgmt: Request file analysis
        ModelMgmt->>InfEngine: Generate analysis
        InfEngine-->>ModelMgmt: Return analysis
        ModelMgmt-->>FileProc: Return analysis results
        FileProc-->>APILayer: Return processed results
        APILayer-->>WebUI: Return JSON response
        WebUI->>WebUI: Format file analysis results
        WebUI-->>User: Display file analysis
    end

🚀 Quick Start

Prerequisites

Python 3.12 or higher
PyTorch compatible hardware (GPU recommended)
Internet connection for web search features
Microsoft C++ Build Tools: Required on Windows for compiling certain Python packages with C extensions (e.g., some dependencies for advanced file processing). Download from Visual Studio Build Tools. Ensure "C++ build tools" are selected during installation.

Installation

Clone the repository:

git clone [repository-url]
cd mastersDegree-finalProject

Run the setup script:

run.bat

The script will:

Create a virtual environment
Install dependencies
Configure PyTorch for your hardware
Start the server
Open the web interface in your default browser

Using the Application

Access the web interface at http://localhost:5000 in your browser. The interface will open automatically when using run.bat.

🔧 Configuration

Environment Variables

FLASK_DEBUG: Enable/disable debug mode
PORT: Server port (default: 5000)
MODEL_PATH: Path to the model (default: HPAI-BSC/Llama3.1-Aloe-Beta-8B)
USE_INTEL_NPU: Enable Intel NPU acceleration
USE_AMD_NPU: Enable AMD NPU acceleration

Trusted Domains

Edit config.ini to modify the list of trusted medical sources.

🎯 Key Components

Server (`server/`)

Flask-based REST API
Model management and inference
- Modular design with Strategy pattern for model distribution
- Support for model parallelism, pipeline parallelism, and partial offloading
File processing and analysis
Web search integration

Utils (`utils/`)

Web scraping functionality
- Modular architecture with provider-specific implementations
- Trusted domain verification
File processing utilities
- Support for various document formats
- Medical term extraction
Medical term detection
Text analysis tools

Code Organization

Modular Architecture: Components are organized into focused, reusable modules
Strategy Pattern: Used for model distribution across different hardware setups
Legacy Support: Backward compatibility layers for evolving interfaces
Clear Separation of Concerns: Each module handles specific functionality

📝 API Reference

Endpoints

GET /api/health: Server health check
POST /api/query: Process medical queries
POST /api/process-file: Analyze medical files
GET /api/device-info: Hardware acceleration info
GET /api/info: API capabilities and status

Sample Request

POST /api/query
{
    "query": "What are the symptoms of type 2 diabetes?",
    "search_web": true
}

🔐 Security

Content validation and sanitization
Trusted domain verification
Input length restrictions
Error handling and logging
Medical disclaimer and usage limitations clearly stated

🤝 Contributing

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Submit a pull request

📜 License

This project is licensed under the CCv1 License - see the LICENSE file for details.

🙏 Acknowledgments

Hugging Face for model hosting
Trusted medical sources (NIH, CDC, Mayo Clinic, WHO, Reuters, etc.) // Added Reuters
Open-source medical research community

made with ❤️ by SUMAN & GEET

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
server		server
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cleanup.bat		cleanup.bat
config.ini		config.ini
mastersDegree-finalYearProject.pptx.url		mastersDegree-finalYearProject.pptx.url
requirements.txt		requirements.txt
run.bat		run.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ClinicalGPT Medical Assistant

🌟 Features

🛠️ System Architecture

🚀 Quick Start

Prerequisites

Installation

Using the Application

🔧 Configuration

Environment Variables

Trusted Domains

🎯 Key Components

Server (`server/`)

Utils (`utils/`)

Code Organization

📝 API Reference

Endpoints

Sample Request

🔐 Security

🤝 Contributing

📜 License

🙏 Acknowledgments

About

Uh oh!

Uh oh!

Languages

License

aka-0x4C3DD/mastersDegree-finalProject

Folders and files

Latest commit

History

Repository files navigation

ClinicalGPT Medical Assistant

🌟 Features

🛠️ System Architecture

🚀 Quick Start

Prerequisites

Installation

Using the Application

🔧 Configuration

Environment Variables

Trusted Domains

🎯 Key Components

Server (server/)

Utils (utils/)

Code Organization

📝 API Reference

Endpoints

Sample Request

🔐 Security

🤝 Contributing

📜 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages

Server (`server/`)

Utils (`utils/`)