Gemini RAG Assistant

A powerful document question-answering application featuring advanced Retrieval-Augmented Generation (RAG) capabilities with Google's Gemini 2.0 Flash model.

Features

Self-RAG Capabilities

Relevance Evaluation: Automatically evaluates how relevant each context chunk is to the query
Context Filtering: Removes less relevant information to improve response quality
Sufficiency Analysis: Determines if the retrieved context is sufficient to answer the query
Adaptive Retrieval: Retrieves additional context when needed

Agentic RAG Capabilities

Query Reformulation: Transforms user queries for more effective retrieval
Iterative Analysis: Multiple rounds of context analysis and improvement
Follow-up Query Generation: Generates specific queries to fill information gaps
Context Synthesis: Creates optimized context by combining and reorganizing information

Document Processing

Supports PDF, DOCX, and TXT documents
Automatically chunks documents for improved retrieval
Semantic search using Google's embedding model

Architecture

Flask Web Application: Lightweight web interface with responsive design
Modular Components: Separate modules for document processing, embedding, retrieval, and generation
In-Memory Storage: Session-based storage for document embeddings
Gemini 2.0 Flash: Leverages Google's latest LLM for intelligent RAG operations

Project Structure

app.py: The main Flask application file. Defines routes for:
- /: Homepage, renders the index.html template.
- /upload: Handles document uploads, processes documents, creates embeddings, and saves them.
- /query: Handles user queries, retrieves relevant context using either Self-RAG or Agentic RAG, generates responses using Gemini, and returns responses along with source information and RAG metrics.
main.py: Entry point to run the Flask application.
pyproject.toml: Project configuration file, including dependencies.
/utils: Core RAG functionality
- agentic_rag.py: Autonomous RAG agent implementation
- document_processor.py: Document parsing and chunking
- embedding.py: Document and query embedding functions
- gemini_integration.py: Integration with Gemini models
- retrieval.py: Semantic search functionality
/static: Frontend assets
- /css: Stylesheets
- /js: JavaScript files
/templates: HTML templates

Getting Started

Prerequisites

Python 3.11+
A valid Google API key for Gemini API access

Installation

Clone the repository:

git clone https://github.com/yourusername/GeminiRagAssistant.git
cd GeminiRagAssistant

Set up a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Set environment variables:

export GOOGLE_API_KEY="your_google_api_key_here"
export SESSION_SECRET="a_secure_random_string"

On Windows:

set GOOGLE_API_KEY=your_google_api_key_here
set SESSION_SECRET=a_secure_random_string

Dependencies

The application requires the following Python libraries:

Flask: Web framework
Google Generative AI: Gemini API access
PyPDF2: PDF processing
docx2txt: DOCX processing
NumPy: Numerical operations
werkzeug: For utility functions for web applications

You can install these dependencies using pip:

pip install Flask google-generativeai PyPDF2 docx2txt numpy werkzeug

Running the Application

Run the application with:

python main.py

The application will be available at http://localhost:5000

Usage Guide

Upload a Document:
- Click on the "Upload Documents" section
- Select a document (PDF, DOCX, or TXT format)
- Wait for processing (document will be chunked and embedded)
Ask Questions:
- Type your question in the query box
- Select your preferred RAG mode:
  - Self-RAG: Faster with real-time relevance filtering
  - Agentic RAG: More thorough with iterative improvements
- Click "Ask" and wait for the response
View the Response:
- The answer will be displayed in the response section
- You can see which sources were used and their relevance
- For Agentic RAG, you'll see additional metrics like context quality and follow-up queries
Example Outputs:

Self-RAG:

Agentic RAG:

How It Works

Self-RAG Process

User uploads document and asks a question
System retrieves initial context chunks based on semantic similarity
Each chunk is evaluated for relevance to the query
Low-relevance chunks are filtered out
System analyzes if the filtered context is sufficient
If needed, additional context is retrieved
Final response is generated using the optimized context

Agentic RAG Process

User uploads document and asks a question
System reformulates the query to improve retrieval
Initial context chunks are retrieved
System analyzes context quality and identifies gaps
Context chunks are prioritized by relevance
System generates follow-up queries to fill gaps
Additional context is retrieved using follow-up queries
Context is synthesized into optimized form
Final response is generated with detailed process metrics

License

MIT License

Acknowledgements

Built with Google's Gemini 2.0 Flash model
Inspired by research on Self-RAG and Agentic RAG approaches

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gemini RAG Assistant

Features

Self-RAG Capabilities

Agentic RAG Capabilities

Document Processing

Architecture

Project Structure

Getting Started

Prerequisites

Installation

Dependencies

Running the Application

Usage Guide

How It Works

Self-RAG Process

Agentic RAG Process

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
screenshots		screenshots
static		static
templates		templates
utils		utils
README.md		README.md
app.py		app.py
main.py		main.py
pyproject.toml		pyproject.toml

ss-369/GeminiRagAssistant---Document-Query-Application

Folders and files

Latest commit

History

Repository files navigation

Gemini RAG Assistant

Features

Self-RAG Capabilities

Agentic RAG Capabilities

Document Processing

Architecture

Project Structure

Getting Started

Prerequisites

Installation

Dependencies

Running the Application

Usage Guide

How It Works

Self-RAG Process

Agentic RAG Process

License

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages