Document the use of the backends

RobinPicard · RobinPicard · commit dc1e98a91255 · 2025-07-21T10:43:04.000+02:00
diff --git a/docs/features/advanced/backends.md b/docs/features/advanced/backends.md
@@ -0,0 +1,56 @@
+---
+title: Structured Generation Backends
+---
+
+# Structured Generation Backends
+
+Outlines relies on a structured generation backend to control text generation for steerable models such thah they conform to the output type provided. One of those backends is of course `outlines-core`, but you also have access to two other libraries that fulfill the same purpose: `llguidance` and `xgrammar`.
+
+## Overview
+
+To select the backend to use for your generation, provide a value for the `backend` argument when calling a model or a generator.
+
+For instance:
+
+```python
+from typing import Literal
+import outlines
+from transformers import AutoModelForCausalLM, AutoTokenizer
+
+output_type = Literal["Paris", "London", "Rome", "Berlin"]
+
+model = outlines.from_transformers(
+    AutoModelForCausalLM.from_pretrained("microsoft/Phi-3-mini-4k-instruct"),
+    AutoTokenizer.from_pretrained("microsoft/Phi-3-mini-4k-instruct")
+)
+
+result = model("What is the capital of France?", output_type, backend="llguidance")
+print(result) # 'Paris'
+
+generator = outlines.Generaor(model, output_type)
+result = generator("What is the capital of France?", backend="xgrammar")
+print(result) # 'Paris'
+```
+
+If you do not provide a value for the `backend` argument, the default value will be used. The default value depends on the type of output type:
+
+- JSON schema: `outlines_core`
+- Regex: `outlines_core`
+- Context-free grammar: `llguidance`
+- Interegular FSM: `outlines_core`
+
+## Features matrix
+
+As mentioned previously, selecting the structured generation backend is only applicable to steerable models, so `Transformers`, `LlmaCpp` and `MLXLM`. Additionaly, some backends do not support some models within those or some output types.
+
+| | outlines_core | llguidance | xgrammar |
+|---|---|---|---|
+| **Models** | | | |
+| Transformers | ✅ | ✅ | ✅ |
+| LlamaCpp | ✅ | ✅ | ❌ |
+| MLXLM | ✅ | ✅ | ❌ |
+| **Output Types** | | | |
+| JSON Schema | ✅ | ✅ | ✅ |
+| Regex | ✅ | ✅ | ✅ |
+| Grammar | ✅ | ✅ | ✅ |
+| FSM | ✅ | ❌ | ❌ |
diff --git a/mkdocs.yml b/mkdocs.yml
@@ -161,6 +161,7 @@ nav:
 
           - Advanced:
                 - Logits Processors: features/advanced/logits_processors.md
+                - Structured Generation Backends: features/advanced/backends.md
 
     - API Reference: api_reference/
 
diff --git a/pyproject.toml b/pyproject.toml
@@ -58,6 +58,8 @@ sglang = ["openai"]
 tgi = ["huggingface_hub"]
 transformers = ["accelerate", "datasets", "transformers"]
 vllm = ["openai"]
+xgrammar = ["xgrammar"]
+llguidance = ["llguidance"]
 test = [
     "pre-commit",
     "pytest",
@@ -89,6 +91,8 @@ test = [
     "dottxt",
     "sentencepiece",
     "mkdocs_gen_files",
+    "llguidance",
+    "xgrammar",
 ]
 test-gpu=["outlines[test]", "vllm; sys_platform == 'linux'"]
 
@@ -170,6 +174,8 @@ module = [
     "tf-keras.*",
     "mkdocs_gen_files.*",
     "jsonpath_ng.*",
+    "llguidance.*",
+    "xgrammar.*",
 ]
 ignore_missing_imports = true