🧠 DocuMind - AI-Powered Knowledge Base Assistant

🚀 Overview

DocuMind is a privacy-focused, self-hosted AI assistant that helps you extract insights from your PDF documents using local LLMs through Ollama. Ask questions about your documents in natural language and receive accurate answers with source citations - all without sending your data to external services.

📄 Upload Documents → 🔍 Ask Questions → 🤖 Get AI Answers

✨ Key Features

🔒 Privacy First: All processing happens locally - no external API calls
📄 Multi-format PDF Processing: Robust text extraction with OCR support
🔍 Hybrid Retrieval System: Combines semantic and keyword search for accuracy
🤖 Local LLM Integration: Uses Ollama (Llama 3.2 3B) for responses
💬 Conversation Memory: Maintains context across multiple questions
📊 Source Attribution: Shows which documents informed each answer
🔄 Automatic Document Loading: Auto-loads PDFs from the documents directory
🌐 Dual Interfaces: Both Streamlit UI and HTML/CSS/JS web interface
⚡ Docker Ready: Simple setup with Docker and GPU acceleration support

🚀 Quick Start with Docker (Recommended)

The easiest way to get started is using the included Docker helper script:

# Make the script executable (if needed)
chmod +x run_docker.sh

# Run the script and follow the menu options
./run_docker.sh

Select option 1 from the menu to start DocuMind, then:

Web UI: http://localhost:8080
API Endpoint: http://localhost:8000/api

💻 Manual Setup (Alternative)

If you prefer to run without Docker:

Install Dependencies
```
pip install -r requirements.txt
```

Install Ollama

# Follow Ollama installation instructions from https://ollama.ai/
# Run the Ollama service
ollama serve

Install OCR Dependencies (Optional)

pip install pytesseract pdf2image pillow
brew install tesseract poppler  # For macOS
# See documentation/OCR_SETUP.md for other OS instructions

Add Documents
- Place PDF files in the data/documents directory

Run the Application

# Start the Web Interface
python api.py

# OR start the Streamlit Interface
streamlit run app.py

🔧 System Architecture

DocuMind/
├── app.py                     # Main Streamlit application
├── api.py                     # Alternative web interface (HTML/CSS/JavaScript)
├── docker-entrypoint.sh       # Docker container startup script
├── docker-compose.yml         # Container orchestration configuration
├── docker-compose.gpu.yml     # GPU support configuration
├── Dockerfile                 # Container definition
├── run_docker.sh              # Docker helper script
├── src/
│   ├── document_processor.py  # PDF processing and extraction with OCR
│   ├── chunking.py            # Semantic text chunking
│   ├── retriever.py           # Hybrid retrieval system
│   ├── llm_handler.py         # LLM integration and prompts
│   ├── evaluator.py           # Evaluation framework
│   ├── preload_models.py      # Model preloading script
│   └── utils.py               # Utility functions
├── data/
│   ├── documents/             # PDF documents for auto-loading
│   ├── vectorstore/           # Chroma vector database
│   ├── models_cache/          # Hugging Face model cache
│   └── chroma_cache/          # ChromaDB ONNX model cache
├── config/
│   └── settings.py            # Configuration settings
├── documentation/             # Detailed documentation files
├── tests/                     # Testing and diagnostic tools
└── web/                       # Web UI assets (HTML/CSS/JS)

📊 Performance Optimization

Embedding Model Caching

DocuMind pre-downloads and caches embedding models to improve startup and query time:

Models are stored in ./data/models_cache/
ONNX optimized versions are kept in ./data/chroma_cache/onnx_models/

LLM Selection

Choose the right LLM based on your hardware:

High-end systems: Use larger models like llama3.2:3b (default)
Low-resource systems: Switch to phi3:mini for faster responses (option 5 in the run_docker.sh menu)

🔍 Advanced Features

Auto-Loading Documents

Documents placed in the data/documents directory are automatically loaded when the app starts

Configure auto-loading behavior in config/settings.py:

AUTO_LOAD_DOCUMENTS = True  # Enable/disable auto-loading
AUTO_LOAD_SKIP_EXISTING = True  # Skip already processed documents

OCR Support for Problematic PDFs

OCR (Optical Character Recognition) processing for difficult PDFs
Automatically detects when a PDF needs OCR and applies it
Perfect for PDFs saved from websites that have selectable text but don't parse correctly
See OCR Setup Guide for detailed setup instructions

PDF Diagnostic Tool

Use tests/check_pdf.py to diagnose problematic PDFs:
```
python tests/check_pdf.py path/to/document.pdf
```
Identifies which extraction method works best for each document
Determines if OCR processing is recommended

🛠️ Troubleshooting Common Issues

1. Timeout Error During Query Processing

Symptom: Requests timeout with error: "Error generating response: Read timed out."

Solution:

Switch to a smaller LLM model through option 5 in the run_docker.sh script
Restart the containers to apply changes

2. Document Loading Issues

Symptom: Documents fail to load or extract properly

Solution:

Check the format of your PDF
Run diagnostic tool: python tests/check_pdf.py path/to/document.pdf
Enable OCR for problematic documents

3. Ollama Connection Issues

Symptom: Error connecting to Ollama service

Solution:

For Docker: Ensure the Ollama container is running (docker ps)
For manual setup: Make sure Ollama is running (ollama serve)
See Environment Setup Guide for details on connection configuration

For more troubleshooting tips, see the Full Documentation.

📚 Documentation

Comprehensive documentation is available in the documentation folder:

📋 Technology Stack

Document Processing: PyPDF2, PyMuPDF, pdfplumber, Tesseract OCR
Embeddings: Sentence-Transformers (all-MiniLM-L6-v2)
Vector Database: ChromaDB
LLM: Ollama (Llama 3.2 3B)
Frontend: Streamlit, HTML/CSS/JavaScript
Backend: Python FastAPI
Containers: Docker, Docker Compose

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Developed with ❤️ by Fakhrul Fauzi.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 DocuMind - AI-Powered Knowledge Base Assistant

🚀 Overview

📄 Upload Documents → 🔍 Ask Questions → 🤖 Get AI Answers

✨ Key Features

🚀 Quick Start with Docker (Recommended)

💻 Manual Setup (Alternative)

🔧 System Architecture

📊 Performance Optimization

Embedding Model Caching

LLM Selection

🔍 Advanced Features

Auto-Loading Documents

OCR Support for Problematic PDFs

PDF Diagnostic Tool

🛠️ Troubleshooting Common Issues

1. Timeout Error During Query Processing

2. Document Loading Issues

3. Ollama Connection Issues

📚 Documentation

📋 Technology Stack

📄 License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
config		config
documentation		documentation
src		src
web		web
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
api.py		api.py
app.py		app.py
docker-compose.gpu.yml		docker-compose.gpu.yml
docker-compose.yml		docker-compose.yml
docker-entrypoint.sh		docker-entrypoint.sh
nginx.conf		nginx.conf
requirements.txt		requirements.txt
reset_chroma.py		reset_chroma.py
run_docker.sh		run_docker.sh

License

p1sangmas/DocuMind

Folders and files

Latest commit

History

Repository files navigation

🧠 DocuMind - AI-Powered Knowledge Base Assistant

🚀 Overview

📄 Upload Documents → 🔍 Ask Questions → 🤖 Get AI Answers

✨ Key Features

🚀 Quick Start with Docker (Recommended)

💻 Manual Setup (Alternative)

🔧 System Architecture

📊 Performance Optimization

Embedding Model Caching

LLM Selection

🔍 Advanced Features

Auto-Loading Documents

OCR Support for Problematic PDFs

PDF Diagnostic Tool

🛠️ Troubleshooting Common Issues

1. Timeout Error During Query Processing

2. Document Loading Issues

3. Ollama Connection Issues

📚 Documentation

📋 Technology Stack

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages