🎓 Academic RAG Assistant

Transform your academic textbooks into an intelligent AI tutor using advanced RAG (Retrieval Augmented Generation) technology

🎯 Overview

Academic RAG Assistant is an intelligent tutoring system that transforms your course textbooks into an interactive AI conversation partner. Using advanced agentic RAG architecture, it provides contextual answers from your specific academic materials across multiple subjects including Linear Algebra, Discrete Structures, and Calculus & Analytical Geometry.

✨ Features

📚 Multi-Subject Expertise: Specialized tools for Linear Algebra, Discrete Structures, and Calculus & Analytical Geometry
🤖 Agentic RAG System: Intelligent query enhancement and routing for optimal retrieval
🔄 Real-time Streaming: Live response generation with visual typing indicators
📱 Modern UI: Clean, responsive dark-themed interface with custom styling
💾 Session Management: Persistent chat history with export functionality
🎛️ Model Selection: Support for multiple Google Gemini models (2.5 Pro, Flash, 2.0 Flash, etc.)
🛡️ Error Handling: Comprehensive error management with user-friendly messages
📊 Progress Tracking: Visual feedback during initialization and processing
🔍 Smart Retrieval: MMR and multi-query retrieval strategies for better context
🧠 Memory System: SQLite-based conversation persistence

🛠️ Tech Stack

Frontend: Streamlit with custom CSS styling
Backend: Python with asyncio for concurrent operations
AI Framework: OpenAI Agents SDK for agentic behavior
LLM Integration: Google Gemini via OpenAI-compatible API
Vector Database: Pinecone for document storage and retrieval
Embeddings: HuggingFace Sentence Transformers (all-MiniLM-L6-v2)
Text Processing: LangChain framework for RAG implementation
Memory: SQLite for session and conversation management

📋 Prerequisites

Python 3.12 or higher
Google Gemini API Key (Get one here)
Pinecone API Key (Get one here)
Pre-processed textbook data in Pinecone vector store

🚀 Quick Start

1. Clone the Repository

git clone https://github.com/ZohaibCodez/academic-rag-assistant.git
cd academic-rag-assistant

2. Install Dependencies

# Using pip
pip install -r requirements.txt

# Using uv (recommended)
uv sync

3. Set Up Environment

# Create .env file
cp .env.example .env
# Edit .env with your API keys

4. Run the Application

streamlit run app.py

5. Access the App

Open your browser and navigate to http://localhost:8501

🔧 Configuration

Environment Variables

Create a .env file in the root directory:

GOOGLE_API_KEY=your_google_gemini_api_key_here
PINECONE_API_KEY=your_pinecone_api_key_here

Supported Models

gemini-2.5-pro (Most capable, recommended for complex analysis)
gemini-2.5-flash (Balanced performance and speed)
gemini-2.5-flash-lite (Lightweight and fast)
gemini-2.0-flash (Fast responses with good accuracy)
gemini-1.5-pro (Reliable baseline model)
gemini-1.5-flash (Quick processing)

Configurable Parameters

CHUNK_OVERLAP = 100        # Text chunk overlap for context
RETRIEVER_K_MMR = 2       # MMR retrieval count
RETRIEVER_K_SIMILARITY = 5 # Similarity search count
LAMBDA_MUL = 0.7          # MMR diversity parameter
EMBEDDING_MODEL = "sentence-transformers/all-MiniLM-L6-v2"

📱 How to Use

Enter API Keys: Add your Google Gemini and Pinecone API keys in the sidebar
Select Model: Choose your preferred Gemini model from the dropdown
Start Learning: Ask questions about your coursework in natural language
View Subjects: Check available subjects in the "Subjects" tab
Export History: Download your conversation anytime from the "Info" tab

📚 Subjects Supported

Linear Algebra

Matrix operations and properties
Systems of linear equations (Gaussian elimination, substitution)
Eigenvalues and eigenvectors
Vector spaces and transformations
Determinants and matrix inverses

Discrete Structures

Mathematical logic and proof techniques
Set theory and relations
Graph theory and trees
Combinatorics and counting principles
Boolean algebra and functions

Calculus & Analytical Geometry

Limits and continuity
Differentiation techniques and applications
Integration methods and applications
Analytical geometry in 2D and 3D
Sequences and series

Example Queries

"Explain the steps to solve a system of linear equations using Gaussian elimination"
"What is mathematical induction and how do I write a proof?"
"How do you find the derivative of a composite function using chain rule?"
"What are eigenvalues and eigenvectors? Provide examples"
"Explain the fundamental theorem of calculus with applications"

🏗️ Architecture

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   User Query    │───▶│  Query Enhancer  │───▶│ Subject Router  │
│   (Natural)     │    │  (Agent System)  │    │ (Classification)│
└─────────────────┘    └──────────────────┘    └─────────────────┘
                                                        │
┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│  Streamlit UI   │◀───│   Agent Runner   │◀───│  Function Tools │
│   (Frontend)    │    │  (Orchestrator)  │    │ (Subject RAG)   │
└─────────────────┘    └──────────────────┘    └─────────────────┘
                               │                        │
┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│  Session Store  │    │  Gemini Models   │    │ Pinecone Vector │
│   (SQLite)      │    │ (Generation AI)  │    │   Database      │
└─────────────────┘    └──────────────────┘    └─────────────────┘
                                                        │
                                                ┌─────────────────┐
                                                │  HuggingFace    │
                                                │   Embeddings    │
                                                └─────────────────┘

🐳 Docker Support

Using Docker

# Create .env file with your API keys
echo "GOOGLE_API_KEY=your-gemini-key-here" >> .env
echo "PINECONE_API_KEY=your-pinecone-key-here" >> .env

# Build and run
docker build -t academic-rag-assistant .
docker run -p 8501:8501 --env-file .env academic-rag-assistant

📁 Project Structure

academic-rag-assistant/
│
├── app.py                              # Main Streamlit application
│
├── notebooks/
│   └── data_preparation_pipeline.ipynb # Complete RAG pipeline setup
│       ├── Step 1a → Multi-Document Ingestion
│       ├── Step 1b → Subject-Aware Text Splitting
│       ├── Step 2 → Retrieval System Setup
│       ├── Step 3 → Tool Definitions
│       └── Agentic RAG Final Form
│
├── logs/                               # Application logs directory
├── Dockerfile                          # Container configuration
├── requirements.txt                    # Python dependencies
├── pyproject.toml                      # Project configuration
├── uv.lock                            # UV dependency lock
├── .env.example                        # Example environment variables
├── .gitignore                          # Git ignore rules
└── README.md                           # Project documentation

📊 Performance Metrics

Query Processing: ~1-3 seconds for typical academic queries
Memory Usage: Optimized vector storage with Pinecone
Retrieval Accuracy: High precision with multi-strategy retrieval
Response Quality: Enhanced by agentic query reformulation
Concurrent Users: Supports multiple simultaneous sessions
Streaming Speed: Real-time response generation with 0.05s intervals

⚠️ Current Limitations

Subject Scope: Limited to three core subjects (Linear Algebra, Discrete Structures, Calculus)
Language: Optimized for English academic content
Data Dependency: Requires pre-processed textbooks in Pinecone
API Limits: Subject to Google Gemini and Pinecone rate limits
Context Window: Limited by model context length for very long documents

🔄 Data Preparation Pipeline

The included Jupyter notebook (data_preparation_pipeline.ipynb) provides a complete walkthrough:

Step 1a: Multi-Document Ingestion

PDF text extraction and processing
Document metadata handling
Quality validation and cleanup

Step 1b: Subject-Aware Text Splitting

Intelligent chunking based on academic structure
Subject-specific namespace organization
Context preservation across chunks

Step 2: Retrieval System Setup

Vector store initialization with Pinecone
Embedding model configuration
Index creation and optimization

Step 3: Tool Definitions

Subject-specific RAG function tools
Query enhancement and routing logic
Response formatting and validation

Agentic RAG Final Form

Complete agent system integration
Testing and validation procedures
Performance optimization techniques

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

Development Setup

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Install development dependencies (uv sync)
Make your changes with comprehensive logging
Test across multiple Gemini models
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Development Guidelines

Follow PEP 8 style guidelines
Add comprehensive logging for new features
Include error handling for all external API calls
Update documentation for new functionality
Test with multiple subjects and query types

📝 Future Roadmap

🐛 Known Issues

Large textbook corpora may require extended initialization time
Complex mathematical notation may not render perfectly
API rate limiting may affect performance during peak usage
Memory usage can be high with multiple concurrent users

🔧 Troubleshooting

Common Issues

"Agent initialization failed" error:

Verify both Google Gemini and Pinecone API keys are valid
Check internet connectivity and API service status
Ensure sufficient API quota remaining

"Vector store connection failed":

Confirm Pinecone API key and index configuration
Verify the "semester-books" index exists with correct namespaces
Check Pinecone service status and regional settings

Slow response times:

Try switching to gemini-2.5-flash for faster responses
Check your network connection stability
Consider using a different model if quota limits are reached

Memory errors:

Restart the Streamlit application
Clear browser cache and session storage
For Docker: increase memory allocation limits

📄 License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI Agents SDK for the agentic framework
Streamlit for the incredible web framework
LangChain for comprehensive RAG implementation
Google AI for Gemini API access
Pinecone for scalable vector database services
HuggingFace for open-source embedding models
Academic community for inspiration and feedback

📞 Support

If you encounter any issues or have questions:

Open an Issue
Check existing issues for solutions
Review the troubleshooting section above
Contact: itxlevicodez@gmail.com

⭐ Star this repository if you found it helpful for your academic journey!

Built with 🖤 for students by @ZohaibCodez using Google Gemini AI and advanced RAG techniques

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
notebooks		notebooks
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
DockerFile		DockerFile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

License

ZohaibCodez/academic-rag-assistant

Folders and files

Latest commit

History

Repository files navigation