Peplink Tech Support RAG Agent (Prototype for Discourse-Forum RAG Agent)

Overview

This repository implements a robust Retrieval-Augmented Generation (RAG) based agentic workflow for technical document QA chat on a multi-source technical corpora (e.g., forums, PDFs, web, YouTube) related to Pepwave cellular routers. Pepwave routers are popular among digital nomads like myself but are meant for network admins and not laypeople. Notably, ChatGPT is not helpful in answering most questions. The goal for this chatbot is to create an assistant that myself and other digital nomads can use to troubleshoot issues and learn to optimize their routers. A yardstick for success is if it can pass the Pepwave Certified Engineer Exam which I have implemented as part of the evaluation framework.

This project is also meant to be a platform for experimenting with various AI Engineering techniques. To support this, I developed a rigorous, modular evaluation system that enables controlled experimentation and provides quantitative, explainable feedback on the impact of different modeling, retrieval, and data processing strategies.

This is also the first iteration of a larger project to create an OSS solution for quickly deploying a RAG chatbot for any given Discourse forum.

Architecture & Core Functionality

1. Data Extraction (`extract/`)

BaseExtractor: Abstracts extraction logic for diverse sources (Reddit, YouTube, web, PDFs, Google Drive, MongoDB).
A separate Javascript repo performs the web scraping for the Pepwave forums using the Discourse API to extract 30k posts a dump into MongoDB.
Enforces a consistent folder structure and streaming interface for raw data.
Validates and serializes extracted data to JSONL files for reproducibility.

2. Data Transformation (`transform/`)

BaseTransform: Standardizes and normalizes raw data into a unified schema for downstream processing.
Handles subject-matter tagging, metadata normalization, document formatting.
Performs sophisticated quality filtering, especially for Reddit/Forum posts, using statistical data science techniques to ensure data quality.
Persists transformed data as parquet files for reproducibility.

3. Data Loading & Deduplication (`load/`)

BaseLoad: Loads transformed data, applies deduplication highly customized to each dataset using a variety of techniques (MinHash, RapidFuzz, NLTK), and prepares documents for vector storage.
Integrates synthetic data via entity extraction (spaCy), LLM-driven summarization and theme extraction, and other techniques.
Leverages the OpenAI Batch API to save $$ permitting a more generous volume of synthetic data generation.
Uploads documents to vector database (Pinecone).

4. RAG Inference (`inference/`)

RagInference: Implements a modular, history-aware RAG pipeline using LangChain, OpenAI LLMs, and Pinecone vector search to provide a chat interface for users.
RagInferenceLangGraph: Implements chat using a more complex LangGraph orchestration that leverages an agentic workflow to provide more reliable answers.

5. Evaluation Framework (`evals/`)

RAGAS: Highly customized fork of the RAGAS library customized for the specific needs of this project. See github repo aubford/ragas.
Testset Generation: Multi-hop QA testset creation using a knowledge graph strategy and LLM-driven prompt synthesis along with human refinement.
RagasEval: End-to-end RAG evaluation with metrics for context recall, precision, faithfulness, relevancy, and accuracy.
MockExam: A test module for pitting the chatbot against a combination of Pepwave-authored mock exam questions and the real Pepwave Certified Engineer Exam.

6. Utilities & Prompt Management (`util/`, `prompts/`)

NLP utilities for tokenization, deduplication, and similarity scoring.
Centralized prompt loading and management for reproducible prompt engineering.

Key Technologies

LangChain, LangGraph (RAG/agentic workflows)
OpenAI API
Pinecone (vector store)
spaCy, NLTK, datasketch, RapidFuzz (NLP & deduplication)
Pandas, numpy, scipy, scikit-learn, matplotlib, huggingface:transformers (data processing)
RAGAS (evaluation)
Pydantic (validation)

Example Workflow

Extract: Run extractors to collect raw data into data/<source>/raw/.
Transform: Run transformers to normalize and serialize documents to data/<source>/documents/.
Load: Run loaders to deduplicate, enrich, and embed documents and then upload to the vector store.
RAG Inference: Run RagInferenceLangGraph for conversational QA (see inference/rag_inference_langgraph.py).
Evaluation: Generate a knowledge graph, testsets and run RAGAS-based and MockExam evaluation using scripts in evals/.

Design Highlights

Evaluation: The evaluation framework is the most complex part of the application. The knowledge graph and testset generation procedures are the product of many iterations and experiments. I was very happy with the quality of the main testset in evals/testsets/testset-200_main_testset_25-04-23. I also did thorough testing to ensure that the metrics are consistent and meaningful at a reasonable price.
Reproducibility: All artifacts (raw, transformed, testsets, evaluation outputs) are versioned and stored for traceability.
Prompt Engineering: Experimented with various prompt engineering techniques. Settled on a prompt management strategy that uses markdown files /prompts which can be easily comprehended, edited and are versioned with the application instead of resorting to fancy cloud storage/versioning options. I like the simplicity.
Best Practices: Type annotations, modular design, and clear separation of concerns throughout.

Name		Name	Last commit message	Last commit date
Latest commit History 364 Commits
.idea		.idea
.vscode		.vscode
aws		aws
config		config
evals		evals
extract		extract
inference		inference
load		load
prompts		prompts
tests		tests
transform		transform
util		util
web_app		web_app
.cursorignore		.cursorignore
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
downloads.py		downloads.py
env.example		env.example
init-db.sql		init-db.sql
requirements-prod.txt		requirements-prod.txt
requirements.txt		requirements.txt
update_langchain_docs.py		update_langchain_docs.py
😬DISCLAIMER😬.md		😬DISCLAIMER😬.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Peplink Tech Support RAG Agent (Prototype for Discourse-Forum RAG Agent)

Overview

Architecture & Core Functionality

1. Data Extraction (`extract/`)

2. Data Transformation (`transform/`)

3. Data Loading & Deduplication (`load/`)

4. RAG Inference (`inference/`)

5. Evaluation Framework (`evals/`)

6. Utilities & Prompt Management (`util/`, `prompts/`)

Key Technologies

Example Workflow

Design Highlights

About

Uh oh!

Releases

Packages

Uh oh!

Languages

aubford/peplink-agent

Folders and files

Latest commit

History

Repository files navigation

Peplink Tech Support RAG Agent (Prototype for Discourse-Forum RAG Agent)

Overview

Architecture & Core Functionality

1. Data Extraction (extract/)

2. Data Transformation (transform/)

3. Data Loading & Deduplication (load/)

4. RAG Inference (inference/)

5. Evaluation Framework (evals/)

6. Utilities & Prompt Management (util/, prompts/)

Key Technologies

Example Workflow

Design Highlights

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

1. Data Extraction (`extract/`)

2. Data Transformation (`transform/`)

3. Data Loading & Deduplication (`load/`)

4. RAG Inference (`inference/`)

5. Evaluation Framework (`evals/`)

6. Utilities & Prompt Management (`util/`, `prompts/`)

Packages