This project aims to implement a RAG-based Local Language Model (LLM) using a locally available dataset. The RAG (Retrieval-Augmented Generation) model combines the strengths of retriever and generator models, enabling more effective and contextually relevant language generation.
The Local Language Model (LLM) implemented in this project is designed to operate locally, ensuring that no sensitive data is leaked to the internet. The model utilizes the RAG architecture, which involves a retriever component for efficient information retrieval and a generator component for language generation.
- RAG Architecture: Integration of the RAG architecture for improved language generation based on local data.
- Ingestor Component: The ingestor component ingests the documents' information into the chromaDB vector database.
- Data Security: No data is sent or leaked to the internet, ensuring the privacy and security of locally available datasets.
- Retriever Component: The retriever component efficiently retrieves relevant information from the local dataset.
- Generator Component: The generator component utilizes the retrieved information to generate contextually relevant language.
Follow these steps to set up the RAG-based Local Language Model:
- Clone the Repository:
- Install Dependencies:
Save the documents that need to be used to the
SOURCE_DOCUMENTS
folder.- Run the Ingestor:
- Run the Model:
- Access the application in your web browser at
http://localhost:8501
)
git clone https://github.com/Sankethhhh/RAG-LLM.git
cd RAG-LLM
pip install -r requirements.txt
python ingest.py
streamlit run app.py
The RAG-based Local Language Model can be used for various natural language processing tasks, such as text completion, question answering, and content generation. Customize the app.py
script according to your specific use case.
The user interface (UI) for this project is based on Streamlit, providing a simple and interactive way to interact with the RAG-based Local Language Model.
This project is licensed under the MIT License.
- The RAG model concept is based on the work by Facebook AI Research.
- Special thanks to the localGPT open-source community for their valuable contributions.
Feel free to explore, experiment, and enhance the RAG-based Local Language Model for your specific use cases!