QA-LLM Guard 🛡️

Automated testing framework for Large Language Models (LLMs), using DeepEval.

📑 Table of Contents

🚀 Overview
🌟 Features
🛠️ Installation
🏗️ Running tests
📚 Directory Structure
📜 License

Overview

This project helps you:

Define structured test cases for LLMs in Python
Evaluate model responses using a local embedding-based metric
Run tests via pytest with optional response caching
Get detailed Allure reports for each test case

Features

✅ LLM test cases defined in Python
✅ Local embedding similarity metric (offline)
✅ OpenAI API support (configurable model)
✅ Fallback to Hugging Face models if no API key
✅ Response caching for faster repeat runs
✅ Parametrized pytest suite with Allure integration

Installation

git clone https://github.com/your-username/qa-llm-guard.git
cd qa-llm-guard
python -m venv venv
source venv/bin/activate  # or .\venv\Scripts\activate on Windows
pip install -r requirements.txt
cp .env.example .env      # and fill in your OPENAI_API_KEY and OPENAI_MODEL if needed

Running tests

Execute the full test suite and generate an Allure report:

pytest --update-cache   # fill or update response cache
pytest --cached         # run using cached responses

--cached — use cached LLM responses
--update-cache — update cache with new LLM responses

Allure results are written to allure-results/. To view the report locally, run:

allure serve allure-results

Directory Structure

qa-llm-guard/
├── llm_guard/              # Core logic
│   ├── loader.py
│   ├── cache.py
│   ├── models/
│   │   └── openai_model.py
│   └── metrics/
│       └── local_embedding.py
├── test_cases/             # LLMTestCase definitions
│   ├── test_geography.py
│   ├── test_math.py
│   ├── test_programming.py
│   ├── test_history.py
│   └── test_creative.py
├── tests/
│   ├── conftest.py         # pytest options and cache fixture
│   └── test_llm_cases.py   # parametrized test for each case
├── requirements.txt
├── .env.example
├── pytest.ini
└── README.md

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

QA-LLM Guard 🛡️

📑 Table of Contents

Overview

Features

Installation

Running tests

Directory Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
llm_guard		llm_guard
test_cases		test_cases
tests		tests
.env.example		.env.example
README.md		README.md
pytest.ini		pytest.ini
requirements.txt		requirements.txt

JohnRitchie/qa-llm-guard

Folders and files

Latest commit

History

Repository files navigation

QA-LLM Guard 🛡️

📑 Table of Contents

Overview

Features

Installation

Running tests

Directory Structure

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages