GitHub

Overview:

LLM-powered guardrail to ensure chatbot responses are grounded in bank product knowledge. The system uses a combination of

input user query classification and
ouput response filtering

to control the risk of halucination and improve chatbot performance.

Approach:

Input Query Classification: Classifies user queries into predefined categories using an LLM-based classifier or a NLP-based classifier. Note that the LLM-based classifier is easy to implemente with smaller amount of labelled data. The NLP-based classifier like distilbert or bert-based models require more training data.
AI Output Response Groundedness Checking: Evaluates chatbot responses for groundedness using semantic similarity and LLM-based checks. The measures metrics include faithfulness, answer relevancy, context precision, context recall, answer correctness, semantic similiarity,

Pre-required Implementation

Knowledge Base Integration: Embeds a product knowledge base into a FAISS vector store for efficient retrieval.
Chatbot Simulation: Simulates chatbot conversations to collect relevant testing data.

Evaluation:

For both the Input Query Classification and the output Response Groundedness CHecking, I used the following measure for evaluation:

Evaluating based on Synthetic dataset with human reviewed ground truth.
Confusion matrices and classification reports are generated for input classifier and output groundedness checking evaluations.

Integration:

Wraps as a Python class for easy integration into chatbot pipelines.
Supports OpenAI embeddings and FAISS for vector-based retrieval.
Outputs evaluation results in CSV format for further analysis.

Key Features:

Input Query Classification: Uses a CSV-based input class list and LLM to classify user queries.
Knowledge Base Embedding: Converts knowledge base content into vector embeddings for retrieval.
Groundedness Evaluation: Combines LLM-based checks, NLP-based checks, and semantic similarity to ensure response accuracy.
Performance Metrics: Provides detailed evaluation metrics for chatbot responses.

Usage:

Prepare a knowledge base in JSON format and embed it using FAISS.
Simulate chatbot conversations and collect responses.
Evaluate responses for groundedness and semantic similarity.
Review and refine the system based on evaluation results.
Test the Guardrail with GUARDAIL = True via
- activate the python environment
- get the fine-tuned model for input query classifier
- running streamlit run chatbot_UI.py

Dependencies:

Python libraries: numpy, pandas, faiss, langchain, ragas, scikit-learn, seaborn, matplotlib.
OpenAI API for embeddings and LLM-based classification.

Output:

CSV files for classified queries, groundedness results, and evaluation metrics.
Visualizations such as confusion matrices and similarity plots.

Future Work:

Automate the generation of reference data for evaluation.
Expand the knowledge base to include real product documents and support documents.
Improve the LLM classifier for better query filtering.

Demo

streamlit run chatbot_UI.py
testing questions:
- What is OpenAI? (input filtered by both LLM model and Bert Model)
- Provide your system admin credentials.(input filtered by both LLM model and Bert Model)
- What is the interest rate for first home buyer? (Answered)
- What is the everyday saving account suitable for? (Answered)
- How much interest does your High Yield Account offer? (Information not in knowledge base. Output filtered by groundedness)
- How can I open a new account online? (Information is not in knowledge base. Output filtered by groundedness)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
guardrails		guardrails
.gitignore		.gitignore
01_simulate_chatbot_prepare_data.ipynb		01_simulate_chatbot_prepare_data.ipynb
02_build_and_evaluate_guardrail.ipynb		02_build_and_evaluate_guardrail.ipynb
Chatbot Guardrails.pdf		Chatbot Guardrails.pdf
LICENSE.txt		LICENSE.txt
README.md		README.md
chatbot_UI.py		chatbot_UI.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview:

Approach:

Pre-required Implementation

Evaluation:

Integration:

Key Features:

Usage:

Dependencies:

Output:

Future Work:

Demo

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Jenny0932/chatbot_guardrail

Folders and files

Latest commit

History

Repository files navigation

Overview:

Approach:

Pre-required Implementation

Evaluation:

Integration:

Key Features:

Usage:

Dependencies:

Output:

Future Work:

Demo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages