📄 RAG-Based PDF QA System

This project provides a REST API built with FastAPI that enables question answering over PDF documents using LangChain. It allows users to ask questions, which are answered by a Large Language Model (LLM) based on the information contained in a predefined PDF document.

🚀 Features

✅ Load and split content from a PDF file
✅ Lexical retrieval using BM25 and TF-IDF
✅ Re-ranking with Flashrank
✅ Use of EnsembleRetriever + ContextualCompressionRetriever
✅ RAG pipeline powered by LangChain
✅ HTTP endpoint built with FastAPI
✅ Configuration via .env file

🧠 Technologies

Python 3.10+
LangChain
FastAPI
OpenAI / LM Studio
BM25 & TF-IDF retriever
PyPDFLoader

Installation

git clone https://github.com/kullaniciadi/rag-pdf-qa.git
cd rag-pdf-qa```

python -m venv venv
source venv/bin/activate

pip install -r requirements.txt

Environment Variables

To run this project, you will need to add the following environment variables to your .env file

PDF_PATH LLM_API_KEY LLM_API_BASE RERANKER_MODEL EMBEDDING_MODEL

Usage/Examples

uvicorn main:app --reload

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
document_processor.py		document_processor.py
main.py		main.py
rag_chain.py		rag_chain.py
requirements.txt		requirements.txt
retrieval.py		retrieval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📄 RAG-Based PDF QA System

🚀 Features

🧠 Technologies

Installation

Environment Variables

Usage/Examples

About

Uh oh!

Releases

Packages

Languages

furkankupcu/Hybrid_RAG

Folders and files

Latest commit

History

Repository files navigation

📄 RAG-Based PDF QA System

🚀 Features

🧠 Technologies

Installation

Environment Variables

Usage/Examples

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages