docker-rag-test

Overview

A proof-of-concept Retrieval-Augmented Generation (RAG) system that demonstrates:

Document ingestion from multiple formats (PDF, HTML, TXT, MD, DOCX)
Text chunking and embedding generation
Local vector storage using ChromaDB
Query API with FastAPI for document search and chat functionality

Features

Multi-format Document Support: Process PDF, HTML, TXT, Markdown, and DOCX files
Flexible Embeddings: Support for both Sentence Transformers (local) and OpenAI embeddings
Local Vector Database: ChromaDB for efficient similarity search
RESTful API: FastAPI-based endpoints for document ingestion, search, and chat
Docker Support: Fully containerized for easy deployment
Test Coverage: Comprehensive test suite for core functionality

For Teams Using This Template

Clone this repository
Replace the documents in data/documents/ with your team's documents (PDF, DOCX, TXT, MD, HTML)
Configure environment by copying .env.example to .env and adding your OpenAI API key (optional)
Run with Docker: docker-compose up --build
Start querying your documents at http://localhost:8000

Your documents will be automatically processed and ready for search and chat!

Quick Start

Prerequisites

Docker and Docker Compose
Python 3.12+ (for local development)
OpenAI API key (optional, for chat functionality)

Using Docker (Recommended)

Clone the repository:

git clone https://github.com/yourusername/docker-rag-test.git
cd docker-rag-test

Create a .env file from the example:

cp .env.example .env
# Edit .env and add your OpenAI API key if you want chat functionality

Add your documents to the data/documents/ directory
Build and run with Docker Compose:

docker-compose up --build

The API will be available at http://localhost:8000. Documents in data/documents/ will be automatically ingested on startup.

Local Development

Install dependencies:

pip install -r requirements.txt

Run the application:

uvicorn src.api.main:app --reload

API Usage

Health Check

curl http://localhost:8000/

Ingest Additional Documents

Documents in data/documents/ are automatically ingested on startup. To manually ingest additional documents:

curl -X POST http://localhost:8000/ingest \
  -H "Content-Type: application/json" \
  -d '{"directory_path": "/app/data/documents"}'

Upload Files

curl -X POST http://localhost:8000/upload \
  -F "files=@document1.pdf" \
  -F "files=@document2.txt"

Search Documents

curl -X POST http://localhost:8000/query \
  -H "Content-Type: application/json" \
  -d '{
    "query": "machine learning",
    "k": 5
  }'

Chat with Documents

curl -X POST http://localhost:8000/chat \
  -H "Content-Type: application/json" \
  -d '{
    "message": "What is machine learning?",
    "k": 3,
    "use_context": true
  }'

Get Document Count

curl http://localhost:8000/documents/count

Clear All Documents

curl -X DELETE http://localhost:8000/documents

Project Structure

docker-rag-test/
├── src/
│   ├── ingestion/         # Document loading and text splitting
│   ├── embedding/         # Embedding generation (OpenAI/Sentence Transformers)
│   ├── storage/           # Vector database interface (ChromaDB)
│   └── api/              # FastAPI application and endpoints
├── tests/                # Test suite
├── data/
│   └── documents/        # Place your documents here for auto-ingestion
├── requirements.txt      # Python dependencies
├── docker-compose.yml    # Docker Compose configuration
└── .env.example         # Environment variables template

Configuration

Environment variables can be set in the .env file:

OPENAI_API_KEY: Your OpenAI API key (required for chat functionality)
CHUNK_SIZE: Size of text chunks (default: 1000)
CHUNK_OVERLAP: Overlap between chunks (default: 200)
EMBEDDER_TYPE: "sentence-transformer" or "openai" (default: "sentence-transformer")
CHROMA_PERSIST_DIRECTORY: Directory for ChromaDB persistence
AUTO_INGEST_ON_STARTUP: Enable/disable auto-ingestion on startup (default: true)
AUTO_INGEST_DIRECTORY: Directory to auto-ingest documents from (default: /app/data/documents)

Testing

Run the test suite:

pytest

Run with coverage:

pytest --cov=src tests/

Development

The project follows these principles:

Test-Driven Development: Tests are written for core functionality
Modular Design: Clear separation between ingestion, embedding, storage, and API layers
Docker-First: Fully containerized for consistent environments
Type Safety: Uses Pydantic for data validation
Async Support: FastAPI with async endpoints for better performance

TODO

High Priority

Fix SentenceTransformerEmbedder api_key parameter error in rag_service.py

Medium Priority

Test Streamlit frontend functionality at http://localhost:8501
Verify that documents in data/documents/ are being ingested correctly

Low Priority

Create .env.example file with proper template variables

Future Enhancements

While this is a proof-of-concept with local storage, the architecture supports easy migration to:

Cloud vector databases (AWS S3 Vector Engine, Pinecone, Qdrant)
Serverless deployment (AWS Lambda)
Container orchestration (AWS ECS/Fargate)
Managed API Gateway integration

Original Goals

This project was built following these principles:

Use most supported and compatible tech stack
Test driven development
Explicit folder structure separating resources from code
Docker-first approach for local development
Design for easy cloud migration
Follow best practices
Version control with regular commits

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.claude		.claude
data/documents		data/documents
docker		docker
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.sh		setup.sh
start_frontend.sh		start_frontend.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

docker-rag-test

Overview

Features

For Teams Using This Template

Quick Start

Prerequisites

Using Docker (Recommended)

Local Development

API Usage

Health Check

Ingest Additional Documents

Upload Files

Search Documents

Chat with Documents

Get Document Count

Clear All Documents

Project Structure

Configuration

Testing

Development

TODO

High Priority

Medium Priority

Low Priority

Future Enhancements

Original Goals

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

adarshkrishnansh/docker-rag-test

Folders and files

Latest commit

History

Repository files navigation

docker-rag-test

Overview

Features

For Teams Using This Template

Quick Start

Prerequisites

Using Docker (Recommended)

Local Development

API Usage

Health Check

Ingest Additional Documents

Upload Files

Search Documents

Chat with Documents

Get Document Count

Clear All Documents

Project Structure

Configuration

Testing

Development

TODO

High Priority

Medium Priority

Low Priority

Future Enhancements

Original Goals

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages