JanaSathi is a bilingual AI chatbot that helps users understand and access Odisha Government schemes. It uses a Retrieval-Augmented Generation (RAG) architecture to retrieve relevant content from official documents and generate responses in English and Odia.
- β Answers queries about government schemes like KALIA Yojana, Mission Shakti, and Biju Swasthya Kalyan Yojana
- π Reads and processes official government PDF documents
- π Uses Large Language Models via OpenRouter API
- π Retrieves relevant information using semantic search (MiniLM)
- π§ Generates responses in English and translates to Odia
- π₯οΈ Streamlit frontend for interactive usage
- Python 3.10+
- Streamlit
- Sentence-Transformers (
all-MiniLM-L6-v2) - OpenAI-compatible LLMs via OpenRouter
- Cohere models for Odia translation
- PyPDF2 for PDF parsing
- scikit-learn for cosine similarity
git clone https://github.com/your-username/odia-e-gov-chatbot.git
cd odia-e-gov-chatbotpip install -r requirements.txtPlace your scheme-related PDFs in the project root directory.
streamlit run streamlit_app.py- PDF documents are processed and chunked into smaller sections.
- Embeddings are generated using
all-MiniLM-L6-v2. - The chatbot retrieves top relevant chunks using cosine similarity.
- Uses a 49B LLM to generate responses in English.
- Translates the responses into Odia using Cohereβs LLM.
- Presents both responses through a clean Streamlit interface.
- Improve document chunking for regional formatting
- Add document upload feature in the UI
- Support voice-based queries and responses
- Log chat history for audit and learning
Anurag Pradhan
π§ anuragpradhancb@gmail.com
π LinkedIn β’ GitHub
This project is licensed under the MIT License.
