RAG CLI Tool

A command-line interface tool that implements Retrieval-Augmented Generation (RAG) using Google's Gemini API. This tool allows you to ask questions about the content of text, PDF documents and Python Codebases.

Features

Process single documents or entire directories
Supports .txt, .pdf, .py files
Uses Google's Gemini API for text generation
Implements RAG pattern using FAISS vector store
Interactive Q&A interface
Ability to mention number of relevant chunks to extract depending upon the specifity of answer required (directly proportional)

Prerequisites

Python 3.8 or higher
Google API key with access to Gemini API

Installation

Clone the repository

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install langchain langchain-google-genai python-dotenv faiss-cpu pypdf

Create a .env file in the project root:
```
GOOGLE_API_KEY=your_api_key_here
```

Usage

Processing a Single File

python rag_cli.py path/to/your/document.txt

Processing a Directory

python rag_cli.py path/to/your/directory

Interactive Mode

After processing the document(s), you can:

Type your questions about the document content or codebase
Get AI-generated answers based on the document context
Type 'quit' or 'exit' to end the session

How It Works

Document Processing:
- Loads documents using LangChain's document loaders
- Splits documents into manageable chunks
Vector Store Creation:
- Creates embeddings using Google's embedding model
- Stores embeddings in a FAISS vector store
Query Processing:
- Retrieves relevant document chunks for each query
- Generates contextual answers using Gemini API

Error Handling

The tool includes comprehensive error handling for:

Invalid API keys
Missing files or directories
Unsupported file types
Password-protected PDFs
API rate limits

Limitations

PDF processing ignores images
Requires active internet connection
Subject to Google API rate limits

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Reviews		Reviews
.gitignore		.gitignore
ECS260FinalReportTeam1.pdf		ECS260FinalReportTeam1.pdf
README.md		README.md
answer_generator.py		answer_generator.py
file_processor.py		file_processor.py
rag_cli.py		rag_cli.py
test_file.txt		test_file.txt
vector_store_creator.py		vector_store_creator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG CLI Tool

Features

Prerequisites

Installation

Usage

Processing a Single File

Processing a Directory

Interactive Mode

How It Works

Error Handling

Limitations

About

Uh oh!

Releases

Packages

Languages

devangb3/RAG-Client

Folders and files

Latest commit

History

Repository files navigation

RAG CLI Tool

Features

Prerequisites

Installation

Usage

Processing a Single File

Processing a Directory

Interactive Mode

How It Works

Error Handling

Limitations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages