VerseMind-RAG

Where Poetry Meets AI

VerseMind-RAG is a document intelligence and semantic generation system designed to provide a complete RAG (Retrieval-Augmented Generation) solution. It supports document loading, chunking, parsing, vector embedding, indexing, semantic search, and text generation.

Features

Document Processing: Load, chunk, and parse documents for structured content.
Vector Embedding: Generate embeddings for semantic search.
Semantic Search: Retrieve relevant content based on user queries.
Text Generation: Generate responses using advanced AI models.
Modular Design: Easily extensible with support for multiple models and vector databases.
Dynamic Model Configuration: Centralized model management system for consistent model selection across components.

System Architecture

Frontend: React + Vite + TailwindCSS
Backend: FastAPI + Python services
Storage: File system + FAISS/Chroma vector database
Models: Ollama (local) + DeepSeek/OpenAI APIs

Quick Start

Prerequisites

Node.js 18+ (for frontend)
Python 3.12.9 (for backend, recommended to use Conda)
Conda (or Miniconda/Anaconda)
SWIG (needed for Faiss, can be installed via Conda: conda install -c anaconda swig)
Ollama (optional, for local model support)

Installation

Clone the repository:

git clone https://github.com/topgoer/VerseMind-RAG.git
cd VerseMind-RAG

Set up Conda environment:

# Create and activate the environment
conda create -n versemind-rag python=3.12.9
conda activate versemind-rag

Install frontend dependencies:
```
cd frontend
npm install
```

Install backend dependencies (choose one method):

Option A: Using pip with requirements.txt

cd backend
pip install -r requirements.txt

Option B: Using Poetry

cd backend
# Install Poetry if not already installed
# pip install poetry
poetry install

Option C: Using uv (faster Python package installer)

cd backend
# Install uv if not already installed
# pip install uv
uv pip install -r requirements.txt

Configure environment variables:

# Create a config.toml file based on the example
cp config/config.example.toml config/config.toml
# Edit the config.toml file to set your API keys and preferences

Running the Project

Start backend:

# Using the convenience script (Windows):
start-backend.bat

# Using the convenience script (macOS/Linux):
chmod +x ./start-backend.sh  # Make executable (first time only)
./start-backend.sh

# OR manually:
cd backend
conda activate versemind-rag  # Or your virtual environment
uvicorn app.main:app --host 0.0.0.0 --port 8200 --reload

The backend will be available at:

API endpoint: http://localhost:8200
API documentation: http://localhost:8200/docs

Start frontend:

# Development mode with hot reload:
cd frontend
npm run dev

# OR using convenience script (Windows):
start-frontend.bat dev

# OR using convenience script (macOS/Linux):
chmod +x ./start-frontend.sh  # Make executable (first time only)
./start-frontend.sh dev

# Production mode:
cd frontend
npm run build
npm run preview

# OR using convenience script (Windows):
start-frontend.bat

# OR using convenience script (macOS/Linux):
chmod +x ./start-frontend.sh  # Make executable (first time only)
./start-frontend.sh

Access the application:
- Development mode: http://localhost:3200
- Production preview: http://localhost:4173

Using Local Models with Ollama

VerseMind-RAG can utilize local AI models through Ollama for both text generation and embedding, which:

Keeps your data private by processing it locally
Eliminates API costs and rate limits
Provides offline functionality

Setup Steps:

Install Ollama from ollama.ai

Pull the models you want to use:

# Pull embedding models (choose at least one)
ollama pull bge-m3        # Smaller, faster embedding model
ollama pull bge-large     # Larger, more accurate embedding model

# Pull generation models (choose models that fit your hardware)
ollama pull gemma3:4b     # Smaller, faster model (4B parameters)
ollama pull deepseek-r1:14b # Better quality, needs more RAM (14B parameters)
ollama pull phi4          # Microsoft's latest model
ollama pull mistral       # Good performance/resource balance
ollama pull llama3.2-vision # For image understanding capabilities

Ensure Ollama is running in the background before starting VerseMind-RAG
In the VerseMind-RAG interface, select your preferred models from the Models dropdown menu

Environment Configuration

The application is configured through config/config.toml. You need to copy and modify the example configuration file:

# Copy the example configuration
cp config/config.example.toml config/config.toml

# Edit the config.toml file according to your needs
# For Ollama users, the default settings should work out of the box
# For API-based models (OpenAI, DeepSeek), you'll need to add your API keys

Basic configuration options:

# Main LLM configuration - selects your default model
[llm]
api_type = "ollama"  # Use "ollama" for local models, "openai" for OpenAI/DeepSeek APIs
model = "gemma3:4b"  # Your preferred model
base_url = "http://localhost:11434"  # Ollama API URL (default)

# Server settings
[server]
host = "0.0.0.0"
port = 8200

# Vector database settings
[vector_store]
type = "faiss"  # Options: "faiss", "chroma"
persist_directory = "./storage/vector_db"

See the config/config.example.toml file for more configuration options.

Advanced Usage

Custom Model Configuration

VerseMind-RAG allows you to customize models through the config/config.toml file. You can configure:

Multiple embedding models with different dimensions and providers
Custom generation models with specific parameters (temperature, max_tokens, etc.)
Provider-specific configurations for different AI services

See the model configuration section in config/config.example.toml for examples.

Vector Database Optimization

VerseMind-RAG supports multiple vector database backends for storing and retrieving document embeddings:

FAISS: Optimized for high-performance vector similarity search
- Excellent for large-scale datasets (millions of vectors)
- Multiple index types (Flat, HNSW32, HNSW64)
- Various distance metrics (cosine, L2, inner product)
Chroma: Feature-rich vector database with metadata filtering
- Rich metadata filtering capabilities
- Easy-to-use API and simple configuration
- Excellent for complex document retrieval requirements

Configuration is managed through the config/config.toml file using this structure:

[vector_store]
type = "faiss"  # Options: "faiss", "chroma"
persist_directory = "./storage/vector_db"

[vector_store.faiss]
index_type = "HNSW32"  # Options: "Flat", "HNSW32", "HNSW64"
metric = "cosine"      # Options: "cosine", "l2", "ip"

[vector_store.chroma]
collection_name = "versemind_docs"
distance_function = "cosine"  # Options: "cosine", "l2", "ip"

For detailed configuration options including index types, distance metrics, and performance tuning recommendations, please refer to the User Guide.

Logging Configuration

VerseMind-RAG supports dynamic logging level configuration through environment variables, allowing you to control the verbosity of logs without modifying the code:

# In your .env file:
LOG_LEVEL=INFO  # Options: DEBUG, INFO, WARNING, ERROR, CRITICAL

Available log levels:

DEBUG: Show all messages, including detailed debugging information
INFO: Show informational messages, warnings, and errors (default)
WARNING: Show only warnings and errors
ERROR: Show only errors
CRITICAL: Show only critical errors

For more details, see Logging Configuration.

Advanced Document Processing

VerseMind-RAG provides flexible document processing configurations that can be adjusted to optimize for different document types and use cases.

For detailed configuration options, please refer to the configuration example file (config/config.example.toml).

How to Use VerseMind-RAG

VerseMind-RAG provides a complete document processing and retrieval pipeline:

Upload Documents via the UI
Process Documents (chunking, parsing, embedding, indexing)
Query Documents through the Chat interface
View Results with source references and visualizations

For detailed usage instructions, sample queries, and best practices, please refer to the User Guide.

Documentation

For detailed documentation, refer to:

Contributing

Contributions to VerseMind-RAG are welcome! Here's how you can contribute:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Please ensure your code passes all tests before submitting a pull request.

Contributing

Contributions to VerseMind-RAG are welcome! Here's how you can contribute:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Please ensure your code passes all tests before submitting a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
backend		backend
config		config
docs		docs
frontend		frontend
milvus-docker		milvus-docker
my-n8n		my-n8n
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
README_CN.md		README_CN.md
debug_toml_structure.py		debug_toml_structure.py
debug_vision_fixed.py		debug_vision_fixed.py
docker-compose.yml		docker-compose.yml
migrate-all-docs.ps1		migrate-all-docs.ps1
pyproject.toml		pyproject.toml
start-backend.bat		start-backend.bat
start-backend.sh		start-backend.sh
start-frontend.bat		start-frontend.bat
start-frontend.sh		start-frontend.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VerseMind-RAG

Features

System Architecture

Quick Start

Prerequisites

Installation

Running the Project

Using Local Models with Ollama

Environment Configuration

Advanced Usage

Custom Model Configuration

Vector Database Optimization

Logging Configuration

Advanced Document Processing

How to Use VerseMind-RAG

Documentation

Contributing

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

topgoer/VerseMind-RAG

Folders and files

Latest commit

History

Repository files navigation

VerseMind-RAG

Features

System Architecture

Quick Start

Prerequisites

Installation

Running the Project

Using Local Models with Ollama

Environment Configuration

Advanced Usage

Custom Model Configuration

Vector Database Optimization

Logging Configuration

Advanced Document Processing

How to Use VerseMind-RAG

Documentation

Contributing

Contributing

License

Acknowledgements

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages