LiteMindUI

A robust, production-ready web interface for Large Language Models (LLMs) featuring a hybrid architecture with FastAPI backend and Streamlit frontend. Built for developers, researchers, and AI enthusiasts who need a comprehensive platform for LLM interaction, document processing, and API integration.

🏷️ Features

🚀 Architecture

Hybrid Design - Combines the best of both worlds:

FastAPI Backend (localhost:8000) - Entry point at main.py with high-performance async API and comprehensive endpoints suitable for asynchronous workload
Streamlit Frontend (localhost:8501) - Entry point at streamlit_app.py with intuitive web interface and automatic backend detection
Modular Services - Includes rag_service.py, ollama.py, file_ingest.py, enhanced_extractors.py, and enhanced_document_processor.py for specialized functionalities
Intelligent Fallback - Seamlessly switches between FastAPI and local processing based on backend availability

✨ Key Features

🔧 Core Capabilities

⚡ High-Performance API - Async FastAPI backend for scalable LLM processing
🧠 Dual Backend Support - Seamlessly switch between Ollama (local) and vLLM (Hugging Face) backends
📚 RAG Integration - Upload documents (PDFs, DOCX, TXT) with enhanced extraction and query with context-aware responses
🔄 Auto-Failover - Intelligent backend detection with graceful fallbacks
🤖 Multi-Model Support - Access to popular models through vLLM or local Ollama models

🛠 Developer Experience

📖 Auto-Generated API Docs - Interactive Swagger UI at /docs
🌐 RESTful Endpoints - Complete API for chat, RAG, and model management
🐍 Pure Python Stack - Easy to extend, customize, and deploy with modular Python files for RAG and LLM interaction
📦 Dependency Management - Reproducible installs with uv

🔒 Production Ready

🏠 Local-First - Runs entirely on localhost, no external dependencies
🔐 CORS Configured - Proper cross-origin resource sharing setup
⚙️ Health Monitoring - Built-in health checks and status monitoring
📊 Streaming Support - Real-time response streaming capabilities

� Installation

The instructions below are tested against this repository: https://github.com/debabratamishra/litemind-ui and Docker images pushed to Docker Hub under the user debabratamishra1 (https://hub.docker.com/u/debabratamishra1).

Option 1: Quick Install (Recommended)

One-line installer (downloads pre-built Docker images and starts services):

📝 Note: Docker deployment currently supports Ollama backend only. vLLM backend support will be added in a future release.

curl -fsSL https://raw.githubusercontent.com/debabratamishra/litemind-ui/main/install.sh | bash

What this does:

Downloads and starts pre-built Docker images from Docker Hub (user: debabratamishra1)
Writes basic configuration files if missing
Starts frontend and backend services using docker-compose

If you prefer to inspect the compose file before starting, see Option 1 (manual) below.

Manual Docker Hub Installation

# Download the production compose file
curl -O https://raw.githubusercontent.com/debabratamishra/litemind-ui/main/docker-compose.hub.yml

# Create required directories (only needed once)
mkdir -p uploads chroma_db storage .streamlit logs

# Start services with the provided compose file
docker-compose -f docker-compose.hub.yml up -d

Available Docker images (hosted on Docker Hub under debabratamishra1):

Option 2: Docker Build from Source

Quick start (build and run locally with Docker):

📝 Note: Docker deployment currently supports Ollama backend only. vLLM backend support will be added in a future release.

Clone the repository

git clone https://github.com/debabratamishra/litemind-ui
cd litemind-ui

Setup Docker environment

make setup
# or manually: ./scripts/docker-setup.sh

Start the application

make up
# or: docker-compose up -d

Access the application

Frontend (Streamlit): http://localhost:8501
Backend API: http://localhost:8000
API Documentation: http://localhost:8000/docs

Prerequisites for Docker:

Docker and Docker Compose installed
Ollama running on host system (if you plan to use local Ollama models) at http://localhost:11434
At least 4GB RAM (8GB+ recommended)

See DOCKER.md for advanced configuration and troubleshooting.

Make commands for Docker Hub images (already provided in the Makefile):

make hub-up      # Start with Docker Hub images
make hub-down    # Stop Docker Hub services
make version     # Show version management options

Option 3: Native (Local Python) Installation

Use this if you prefer to run services locally without Docker. These instructions assume Python 3.12+ and a virtual environment.

Clone the repository

git clone https://github.com/debabratamishra/litemind-ui
cd litemind-ui

Create and activate a virtual environment, then install dependencies

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install -r requirements.txt

Create required directories

mkdir -p uploads .streamlit

Environment variables you may want to set (examples):

export OLLAMA_BASE_URL="http://localhost:11434"
export UPLOAD_FOLDER="./uploads"

Notes:

Running the full stack natively requires additional setup (Ollama, model files, vLLM/GPU drivers) and is intended for development.
For most users, the Docker-based Quick Install is the simplest way to get started.

Platform notes

Quick notes for different host platforms:

macOS (Apple Silicon / M1/M2): Docker will run amd64 images under emulation which can be slower. The installer now auto-sets DOCKER_DEFAULT_PLATFORM=linux/amd64 for arm64 hosts. If you prefer, set it manually before running the installer:

export DOCKER_DEFAULT_PLATFORM=linux/amd64
curl -fsSL https://raw.githubusercontent.com/debabratamishra/litemind-ui/main/install.sh | bash

macOS (Intel) and Linux (Ubuntu): The quick-install should work as-is provided Docker and docker-compose (or the Docker Compose CLI plugin) are installed.
Windows: Run the installer inside WSL2 (recommended) or Git Bash. Plain PowerShell/cmd doesn't provide bash by default. Example using WSL2:

wsl
# inside WSL shell
curl -fsSL https://raw.githubusercontent.com/debabratamishra/litemind-ui/main/install.sh | bash

If you run into platform/architecture errors during image pull, try pulling manually and inspecting logs:

docker-compose -f docker-compose.hub.yml pull
docker-compose -f docker-compose.hub.yml up -d
docker-compose -f docker-compose.hub.yml logs -f


2. **Install dependencies**

```bash
uv pip install -r requirements.txt

Create required directories

mkdir -p uploads .streamlit

The UPLOAD_FOLDER can be customized via environment variables.

🔌 API Endpoints

Endpoint	Method	Description
`/health`	GET	Backend health check
`/models`	GET	Available Ollama models
`/api/chat`	POST	Process chat messages (supports both Ollama and vLLM backends)
`/api/chat/stream`	POST	Streaming chat responses (supports both backends)
`/api/rag/upload`	POST	Upload documents for RAG processing
`/api/rag/query`	POST	Query uploaded documents with context-aware responses
`/api/rag/documents`	GET	List uploaded documents
`/api/vllm/models`	GET	Available vLLM models and configuration
`/api/vllm/set-token`	POST	Configure Hugging Face access token

💡 Usage Examples

Chat Interface

Navigate to the Chat tab
Select Backend: Choose between Ollama (local) or vLLM (Hugging Face)
Configure Models:
- For Ollama: Select from locally installed models
- For vLLM: Choose from popular models or enter custom model names
Enter your message and receive AI responses

Document Q&A (RAG)

Switch to the RAG tab
Upload PDF, TXT, or DOCX files
Choose Backend: RAG works with both Ollama and vLLM backends
Query your documents with natural language
Get contextually relevant answers

Backend Switching

Seamless Integration: Switch between backends without losing your current page
Model Persistence: Backend-specific model selections are preserved
Automatic Configuration: UI adapts based on selected backend capabilities

🌐 API Integration

Easily interact with the LiteMindUI backend from your applications.

Python Example

import requests

response = requests.post(
  "http://localhost:8000/api/chat",
  json={"message": "Hello, world!", "model": "llama3.1"}
)
print(response.json()["response"])

JavaScript Example (Node.js)

const fetch = require('node-fetch');

fetch('http://localhost:8000/api/chat', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({ message: 'Hello, world!', model: 'llama3.1' })
})
  .then(res => res.json())
  .then(data => console.log(data.response))
  .catch(err => console.error(err));

For a complete list of endpoints and request/response formats, visit the Swagger UI:

🔧 Configuration

Streamlit Settings

Create .streamlit/config.toml:

[server]
address = "localhost"
port = 8501

Environment Variables

export OLLAMA_BASE_URL="http://localhost:11434"
export UPLOAD_FOLDER="./uploads"

🎯 Advanced Features

Backend Detection: Automatic FastAPI availability checking with local fallback
Dynamic Models: Real-time model list fetching from Ollama backend
Streaming Responses: Real-time token streaming for better UX
Document Processing: Multi-format document ingestion and vectorization performed at ingestion for faster retrieval
Error Handling: Comprehensive error handling with user-friendly messages

🔧 Troubleshooting

Quick Fixes for Common Issues

Docker Deployment Issues:

Ollama not accessible: Ensure Ollama is running with ollama serve
Permission errors: Run chmod 755 ~/.cache/huggingface ~/.ollama
Port conflicts: Check with lsof -i :8000 :8501 and kill conflicting processes
Container build fails: Clean with make clean && make setup && make up

Backend Issues:

vLLM backend not working: Verify Hugging Face token is valid and model exists
Backend switching problems: Clear browser cache and reload the page
Model loading errors: Check model compatibility and available GPU memory

Native Installation Issues:

Module not found: Reinstall dependencies with uv pip install -r requirements.txt
Streamlit not starting: Check if port 8501 is available
FastAPI errors: Verify Python 3.12+ and check logs in terminal

General Issues:

Models not loading: Verify Ollama is running and models are pulled
Upload failures: Check uploads directory permissions
RAG not working: Ensure documents are uploaded and processed successfully

📖 For comprehensive troubleshooting guides:

Docker issues: DOCKER.md
Health checks: DOCKER_HEALTH_CHECKS.md

🤝 Contributing

Fork the repository
Create a feature branch: git checkout -b feature-name
Commit changes: git commit -am 'Add feature'
Push to branch: git push origin feature-name
Submit a Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.github		.github
.streamlit		.streamlit
app		app
scripts		scripts
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
DOCKER.md		DOCKER.md
DOCKER_ENVIRONMENTS.md		DOCKER_ENVIRONMENTS.md
DOCKER_HEALTH_CHECKS.md		DOCKER_HEALTH_CHECKS.md
DOCKER_PUBLISHING.md		DOCKER_PUBLISHING.md
Dockerfile		Dockerfile
Dockerfile.streamlit		Dockerfile.streamlit
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
RELEASE-0.0.1.md		RELEASE-0.0.1.md
Swagger.png		Swagger.png
config.py		config.py
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.hub.yml		docker-compose.hub.yml
docker-compose.macos.yml		docker-compose.macos.yml
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
fix_macos.sh		fix_macos.sh
install.sh		install.sh
litemindui_demo.gif		litemindui_demo.gif
logging_config.py		logging_config.py
main.py		main.py
requirements.txt		requirements.txt
start_app.sh		start_app.sh
streamlit_app.py		streamlit_app.py
version.json		version.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LiteMindUI

🏷️ Features

🚀 Architecture

✨ Key Features

🔧 Core Capabilities

🛠 Developer Experience

🔒 Production Ready

� Installation

Option 1: Quick Install (Recommended)

Option 2: Docker Build from Source

Option 3: Native (Local Python) Installation

Platform notes

🔌 API Endpoints

💡 Usage Examples

Chat Interface

Document Q&A (RAG)

Backend Switching

🌐 API Integration

Python Example

JavaScript Example (Node.js)

🔧 Configuration

Streamlit Settings

Environment Variables

🎯 Advanced Features

🔧 Troubleshooting

Quick Fixes for Common Issues

🤝 Contributing

About

Uh oh!

Releases

Packages

Languages

License

debabratamishra/litemind-ui

Folders and files

Latest commit

History

Repository files navigation

LiteMindUI

🏷️ Features

🚀 Architecture

✨ Key Features

🔧 Core Capabilities

🛠 Developer Experience

🔒 Production Ready

� Installation

Option 1: Quick Install (Recommended)

Option 2: Docker Build from Source

Option 3: Native (Local Python) Installation

Platform notes

🔌 API Endpoints

💡 Usage Examples

Chat Interface

Document Q&A (RAG)

Backend Switching

🌐 API Integration

Python Example

JavaScript Example (Node.js)

🔧 Configuration

Streamlit Settings

Environment Variables

🎯 Advanced Features

🔧 Troubleshooting

Quick Fixes for Common Issues

🤝 Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages