RAG-Based Chatbot

🚀 Overview

This project is a Retrieval-Augmented Generation (RAG) chatbot built with Next.js 15+ (App Router), Groq API, Weaviate vector database, Puppeteer, and Redux for state management. The chatbot scrapes content from user-provided URLs, stores it in a vector database, and generates intelligent responses with real-time citations.

🎯 Features

RAG Architecture: Enhances chatbot responses with relevant retrieved content.
Source Citations: Displays clickable links to the original sources.
Real-Time Scraping: Uses Puppeteer for web scraping.
Weaviate for Vector Storage: Efficient storage and retrieval of knowledge.
Groq API for AI Responses: Generates accurate, context-aware responses.
Redux for State Management: Handles chat history efficiently.
Next.js App Router: Optimized for server-side and client-side rendering.

🛠️ Tech Stack

Frontend: Next.js 15+, React, Tailwind CSS, Redux
Backend: Node.js, Express, Puppeteer, Weaviate, Groq API
Database: Weaviate (vector storage)
Deployment: Vecerl

📦 Installation

1️⃣ Clone the Repository

git clone https://github.com/TheUzair/RAG-Chatbot.git
cd rag-chatbot

2️⃣ Install Dependencies

npm install

3️⃣ Set Up Environment Variables

Create a .env.local file and add:

GROQ_API_KEY=your-groq-api-key
WEAVIATE_URL=your-weaviate-instance-url
WEAVIATE_API_KEY=your-weaviate-api-key

4️⃣ Run the Application

npm run dev

Access the chatbot at http://localhost:3000.

🔧 How It Works

1️⃣ Scraping a Website

User enters a URL.
Puppeteer extracts text content.
Data is stored in Weaviate as vector embeddings.

2️⃣ Query Processing

User asks a question.
Weaviate retrieves the most relevant content.
Groq API generates a response using both retrieved and general knowledge.

3️⃣ Response Display

The chatbot displays the response.
If a source is available, a clickable citation is shown.

💬 Example Responses

✅ Example 1: Factual Query

User: "What is quantum computing?" Bot: "Quantum computing uses quantum bits (qubits) to perform computations that classical computers struggle with. Click here to read more."

✅ Example 2: Context-Aware Query

User: "Summarize the main points of this article." Bot: "The article discusses the latest AI trends, focusing on ethical concerns and advancements. Click here for details."

🛑 Edge Cases

✅ Handled Cases

Empty Queries: Prevents sending blank messages.
Invalid URLs: Displays an error if an incorrect URL is entered.
Slow Responses: Shows a loading indicator while fetching results.
Rate Limits: Implements retry mechanisms.

❌ Unhandled Cases

JavaScript-Rendered Pages: If content is hidden behind client-side rendering, Puppeteer may not capture it.
Highly Dynamic Content: Pages that change frequently may lead to outdated embeddings.
Multimodal Inputs: Currently, only text-based queries are supported.

📌 Future Enhancements

✅ Add support for multimedia content retrieval (images, videos).
✅ Improve citation handling with multiple source references.
✅ Enhance multi-turn conversations.
✅ Expand language support for global users.

🤝 Contributing

Pull requests are welcome! Feel free to fork the repo and submit your contributions.

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
app		app
public		public
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
eslint.config.mjs		eslint.config.mjs
jsconfig.json		jsconfig.json
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.mjs		tailwind.config.mjs
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-Based Chatbot

🚀 Overview

🎯 Features

🛠️ Tech Stack

📦 Installation

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Set Up Environment Variables

4️⃣ Run the Application

🔧 How It Works

1️⃣ Scraping a Website

2️⃣ Query Processing

3️⃣ Response Display

💬 Example Responses

✅ Example 1: Factual Query

✅ Example 2: Context-Aware Query

🛑 Edge Cases

✅ Handled Cases

❌ Unhandled Cases

📌 Future Enhancements

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

TheUzair/RAG-Chatbot

Folders and files

Latest commit

History

Repository files navigation

RAG-Based Chatbot

🚀 Overview

🎯 Features

🛠️ Tech Stack

📦 Installation

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Set Up Environment Variables

4️⃣ Run the Application

🔧 How It Works

1️⃣ Scraping a Website

2️⃣ Query Processing

3️⃣ Response Display

💬 Example Responses

✅ Example 1: Factual Query

✅ Example 2: Context-Aware Query

🛑 Edge Cases

✅ Handled Cases

❌ Unhandled Cases

📌 Future Enhancements

🤝 Contributing

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages