ConfluAI: A Local RAG System with LLaMA 3.2, Hugging Face Embeddings, and FAISS

ConfluAI is a Retrieval-Augmented Generation (RAG) system built to run entirely locally. It leverages modern NLP technologies to answer questions and engage in continuous chat sessions using your project documentation (e.g., Confluence exports) as the knowledge base.

🔥 Why ConfluAI?

Modern teams are drowning in ever-growing documentation—Confluence pages, design specs, runbooks, and support guides. Finding the right answer can take minutes or even hours, slowing down development, support, and decision-making. ConfluAI transforms your sprawling Confluence knowledge base into a high-precision, citation-backed AI assistant that delivers the right answer—fast.

🚀 What You Get

Instant, Accurate Answers
Leverage state-of-the-art embeddings (HFEmbedding + FAISS) and LLaMA 3 via Ollama to surface the most relevant snippets in milliseconds.

No-Hallucination Confidence
Our strict “no-hallucination” prompt template ensures the model only answers from your actual docs—otherwise it says “I don’t know.”

Transparent Citations
Every response comes with file paths and chunk-level scores, giving your team full auditability and trust.

Rich Context Handling
Intelligent pre-segmentation and metadata-aware chunking preserve headings, paragraphs, code blocks, and lists—so every answer is grounded in coherent, semantically meaningful sections.

Privacy & Control
Runs entirely on-prem or in your private cloud—no data ever leaves your environment.

Easy Integration
Out-of-the-box Python modules (scraper.py, indexer.py, main.py) slot into your CI/CD pipelines, Confluence export jobs, or scheduled ETL workflows.

🌟 Key Features

Feature Benefit

Automated Confluence Scraping Keeps index up-to-date with minimal manual effort RecursiveCharacterTextSplitter Fine-grained control over chunk size & overlap FAISS Vector Store Lightning-fast semantic search on millions of vectors RichPromptTemplate Structured “SYSTEM → CONTEXT → QUESTION → ANSWER” pipelines Citation Map with Confidence File & chunk IDs plus similarity scores for full transparency Low-Latency Local LLM Deploy locally with Ollama for sub-second inference

Features

Local Processing: All components run on your local machine—no external API keys or cloud dependencies.
Web Scraping: Unintigrated webscraper snippet to retrieve confluence content.
Embeddings: Uses Hugging Face Transformers (e.g., BAAI/bge-small-en-v1.5) to convert text into dense vector representations.
Vector Retrieval: Utilizes FAISS for fast and efficient vector search.
LLM Integration: Queries a local LLaMA 3.2 model served via Ollama.
Context-aware Chat: Supports context-aware quention queries.
Modular Design: Clear separation of concerns across multiple modules for ease of maintenance and extension.

WIP Feature

Integrated web scraper
Continuous conversations using a custom workflow

Repository Structure

   ConfluAi/
   ├── README.md              # This file – repository overview and instructions
   ├── base_urls.txt          # A text file with the list of base URLs for web scraping
   ├── config.py              # Global configuration file (paths, model names, etc.)
   ├── confluence_docs/       # Directory to store scraped and cleaned Confluence documents (as text files)
   ├── embeddings.py          # Module that implements the HFEmbedding adapter using Hugging Face Transformers
   ├── faiss_index/           # Directory where the FAISS vector store and associated JSON files are persisted
   ├── indexer.py             # Core module for building the FAISS index and querying it with LlamaIndex
   ├── llm_adapter.py         # LLM adapter for querying the local LLaMA 3.2 via Ollama
   ├── scrapper.py            # Web scraper module for collecting Confluence docs (moved to root)
   ├── main.py                # Main entry point for ingesting documents and querying the system
   ├── requirements.txt       # List of required Python packages
   ├── tests/                 # Directory containing test scripts for the project
   │   ├── test_embedding.py  # Test file for embeddings functionality
   │   └── test_fiass.py      # Test file for FAISS-related functionality
   └── utils/                 # Directory for additional utilities and helper functions
      └── utils.py           # Other helper functions (e.g., cleaning, file operations)

Getting Started

Prerequisites

Python 3.9.6+
Ollama: Install and run Ollama on your local machine to serve your LLaMA 3.2 model.
Required Python Packages: See the Installation section below.

Installation

Clone the Repository:

git clone https://github.com/yourusername/ConfluAI.git
cd ConfluAI

Create a Virtual Environment and Activate It:

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Create configuration files and directory

mkdir confluence_docs # Should match the DOCS_DIR in the config.py
touch base_urls.txt

Populate the base_urls.txt with scraping URLs Example:

https://example.com/
https://dummyjson.com/docs
https://www.postman.com/api-platform/api-testing/

Install Dependencies:
```
pip install -r requirements.txt
```

Configuration

Edit config.py to configure your paths and models. For example:

# config.py
DOCS_DIR = "./confluence_docs"         # Directory with your cleaned Confluence documents (.txt/.md)
FAISS_INDEX_PATH = "faiss_index"         # Directory for persisting the FAISS index and associated JSON files
EMBEDDING_DIM = 384                      # Embedding dimension (adjust based on your model)

OLLAMA_API_URL = "http://localhost:11434/api/generate"  # Ollama API endpoint
OLLAMA_MODEL_NAME = "llama3.2:latest"      # Specify your local LLaMA model (adjust as needed)

VALID_LINK_SUBSTRING = "Content/<YourContent>"
VALID_LINK_EXTENSION = ".htm"
EXCLUDE_EXTENSIONS = ['.pdf', '.zip']

Ingesting Documents and Building the Index

Prepare Your Data:
1. Create the directoy specified in DOCS_DIR in the config.py Place your cleaned Confluence documents (plain text or markdown) in the directory specified by DOCS_DIR.
2. Create a file called base_urls.txt and populate it with the list of urls to scrape.
3. scrapper.py is an standalone modeul that has to be run separately to populate DOCS_DIR.
```
python3 scrapper.py
```
Build the Index: In main.py, uncomment the build_index() call and run:
```
python3 main.py
```
This will:
1. Load, clean, and chunk your documents.
2. Compute embeddings using your Hugging Face model.
3. Build a FAISS index with LlamaIndex.
4. Persist the index (including docstore, vector store, and metadata) to the directory specified by FAISS_INDEX_PATH.
Querying the Index

After building the index:
1. Disable Index Building: Comment out or remove the build_index() call in main.py.
2. Query the System: Modify the sample question if needed, then run:
```
python3 main.py 
```
This loads the persisted index, retrieves relevant context, assembles a prompt, and uses your local LLaMA 3.2 (via Ollama) to generate an answer.

Troubleshooting

Persisted Files Missing: Ensure that you have run the indexing phase successfully. Your persist directory (FAISS_INDEX_PATH) should contain several JSON files such as default__vector_store.json, docstore.json, and index_store.json.
Ollama Issues: Verify that your Ollama instance is running on http://localhost:11434 and that your specified model is loaded.
LLM API Key Errors: Since this project uses a local LLM via Ollama, no external API keys (like OpenAI’s) are required. If you see errors related to API keys, ensure your configuration in config.py is correct.

Contributing

Contributions are welcome! Please open an issue or submit a pull request for any improvements, bug fixes, or new features.

License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

ConfluAI: A Local RAG System with LLaMA 3.2, Hugging Face Embeddings, and FAISS

🔥 Why ConfluAI?

🚀 What You Get

🌟 Key Features

Feature Benefit

Features

WIP Feature

Repository Structure

Getting Started

Prerequisites

Installation

Configuration

Ingesting Documents and Building the Index

Troubleshooting

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
tests		tests
utils		utils
.gitignore		.gitignore
README.md		README.md
config.py		config.py
embeddings.py		embeddings.py
indexer.py		indexer.py
llm_adapter.py		llm_adapter.py
main.py		main.py
prompt.py		prompt.py
requirements.txt		requirements.txt
scrapper.py		scrapper.py

Uh oh!

Uh oh!

rohit1223/ConfluAi

Folders and files

Latest commit

History

Repository files navigation

ConfluAI: A Local RAG System with LLaMA 3.2, Hugging Face Embeddings, and FAISS

🔥 Why ConfluAI?

🚀 What You Get

🌟 Key Features

Feature Benefit

Features

WIP Feature

Repository Structure

Getting Started

Prerequisites

Installation

Configuration

Ingesting Documents and Building the Index

Troubleshooting

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages