Toolsets #2024

DouweM · 2025-06-19T00:09:22Z

Closes Toolset, take 3 #1973

To fix:

MCPToolset
Retry counting
Streaming tool calls

To improve:

Name conflict error messages and logic
Performance, caching when possible
Considering making MCPServer a toolset, instead of needing MCPServer.as_toolset()
Various other TODO comments
How output tools are handled
DeferredToolCalls with run_stream

To add:

Pass toolsets to run/iter etc. Should override toolsets passed to Agent, but not output tools, @agent.tools tools, Agent(tools=...) tools, or Agent(mcp_servers=...) tools. If you want those to be overridden, you should pass them as Agent(toolsets=FunctionToolset(...))
LangChainToolset(tools=...)
ACIToolset?
Deferred tools that need to be be resolved externally
- Toolset, take 3 #1973 (comment)
- Add to_chat_completions method #2041
Agent(prepare_output_tools=...): Closes Make output_type be conditionally available #2042
Docs
New tests

# Conflicts: # tests/models/test_openai.py

# Conflicts: # pydantic_ai_slim/pydantic_ai/_output.py # pydantic_ai_slim/pydantic_ai/models/openai.py # tests/models/test_openai.py

….)', less 'output_schema.mode == ...'

# Conflicts: # pydantic_ai_slim/pydantic_ai/models/openai.py # pydantic_ai_slim/pydantic_ai/profiles/openai.py # tests/models/test_google.py # tests/models/test_openai_responses.py

# Conflicts: # pydantic_ai_slim/pydantic_ai/_utils.py # pydantic_ai_slim/pydantic_ai/agent.py # tests/models/test_anthropic.py # tests/test_utils.py

…tput

pydantic_ai_slim/pydantic_ai/toolsets/mapped.py

tests/test_streaming.py

tests/test_agent.py

pydantic_ai_slim/pydantic_ai/toolsets/run.py

pydantic_ai_slim/pydantic_ai/toolsets/processed.py

pydantic_ai_slim/pydantic_ai/toolsets/prefixed.py

pydantic_ai_slim/pydantic_ai/toolsets/__init__.py

# Conflicts: # pydantic_ai_slim/pydantic_ai/agent.py # pydantic_ai_slim/pydantic_ai/mcp.py

…pass sampling_model to MCPServer through RunContext, and make Agent an async contextmanager instead of run_toolsets

… being stored on read broke a test

…gent.set_mcp_sampling_model

…egy stops later tools from being called

# Conflicts: # pydantic_ai_slim/pydantic_ai/agent.py # pydantic_ai_slim/pydantic_ai/mcp.py # tests/test_examples.py

Viicos · 2025-07-07T14:02:08Z

pydantic_ai_slim/pydantic_ai/tools.py

@@ -464,4 +362,12 @@ class ToolDefinition:
    Note: this is currently only supported by OpenAI models.
    """

+    kind: ToolKind = field(default='function')
+    """The kind of tool:


Suggested change

"""The kind of tool:

"""The kind of tool:

Otherwise list doesn't display properly.

Viicos · 2025-07-07T14:05:14Z

docs/mcp/client.md

@@ -47,11 +47,11 @@ from pydantic_ai import Agent
 from pydantic_ai.mcp import MCPServerSSE

 server = MCPServerSSE(url='http://localhost:3001/sse')  # (1)!
-agent = Agent('openai:gpt-4o', mcp_servers=[server])  # (2)!
+agent = Agent('openai:gpt-4o', toolsets=[server])  # (2)!


Just for my naive understanding, mcp servers can provide more concepts than just tools (e.g. resources). I don't know if PydanticAI concepts exists for these other concepts; if it is the case (or will be in the future), how are we going to make use of them?

@Viicos That's a good point. Curious what Samuel etc think.

For MCP resources, I think we're OK because "Resources are designed to be application-controlled, meaning that the client application can decide how and when they should be used." In the case of Pydantic AI, that would mean the user code has to explicitly do things like server.list_resources() and server.read_resource(...) (possibly from a tool or dynamic instructions function), and there's no automatic resources-related behavior they'd get just from registering the MCP server with the agent. So if they don't want to register it as a toolset, they can still use it directly. They would be recommended to enter its context manually to ensure the server is running, instead of getting this automatically from entering the agent context, but this PR also has the MCP server context be entered automatically when it's needed (to prepare for Temporal where every "activity" -- anything that does IO, like a tool call -- runs in isolated context and they can't share a connection anyway).

For MCP prompts, I'd expect the user to similarly explicitly need to call server.get_prompt(...) from e.g. a dynamic instructions function (or just when building the agent).

I'd expect other future MCP features to also not have the same "auto-use" dynamic that MCP tools do.

So I'd interpret this less as "an MCP server is just a toolset now", and more "an MCP server can be used directly as a toolset, and other things". The only thing users would lack if they don't register it as a toolset would be automatic entering of the context when the agent context is entered, which I think is acceptable.

Viicos · 2025-07-07T14:12:53Z

docs/mcp/client.md



 async def main():
-    async with agent.run_mcp_servers():  # (3)!
+    async with agent:  # (3)!


What was the motivation to make the context manager "implicit" here? Doesn't it feels weird to have Agent.__aenter__() reserved for MCP logic?

We also intend to use it for other things, like httpx clients: #1695 (comment)

Viicos · 2025-07-07T14:20:04Z

pydantic_ai_slim/pydantic_ai/_agent_graph.py

    """
+    span_attributes = {


An idea that might be worth exploring in the future: I'm wondering if attributes could be lazily computed if a NoOpTracer is used (i.e. you don't have instrumentation set). Most likely the runtime overhead isn't too big, but would be interesting to investigate.

Viicos · 2025-07-07T14:21:15Z

pydantic_ai_slim/pydantic_ai/toolsets/deferred.py

+
+
+class DeferredToolset(AbstractToolset[AgentDepsT]):
+    """A toolset that holds deferred tool."""


Suggested change

"""A toolset that holds deferred tool."""

"""A toolset that holds deferred tools."""

Viicos · 2025-07-07T14:25:21Z

pydantic_ai_slim/pydantic_ai/toolsets/__init__.py

+        return self.__class__.__name__.replace('Toolset', ' toolset')
+
+    @property
+    def tool_name_conflict_hint(self) -> str:


Maybe have this property private

Viicos · 2025-07-07T14:26:34Z

pydantic_ai_slim/pydantic_ai/toolsets/_callable.py

+    async def call_tool(self, call: ToolCallPart, ctx: RunContext[AgentDepsT], allow_partial: bool = False) -> Any:
+        ctx = replace(ctx, tool_name=call.tool_name, tool_call_id=call.tool_call_id)
+
+        pyd_allow_partial: Literal['off', 'trailing-strings'] = 'trailing-strings' if allow_partial else 'off'


pyright should infer the type to a literal already

Suggested change

pyd_allow_partial: Literal['off', 'trailing-strings'] = 'trailing-strings' if allow_partial else 'off'

pyd_allow_partial = 'trailing-strings' if allow_partial else 'off'

Viicos · 2025-07-07T14:30:00Z

pydantic_ai_slim/pydantic_ai/toolsets/combined.py

+
+    async def __aexit__(self, *args: Any) -> bool | None:
+        self._entered_count -= 1
+        if self._entered_count <= 0 and self._exit_stack is not None:


Logically it should be < 0?

Suggested change

if self._entered_count <= 0 and self._exit_stack is not None:

if self._entered_count == 0 and self._exit_stack is not None:

Copied from MCPServer's implementation, but agreed :)

DouweM added 30 commits June 3, 2025 03:20

WIP: Output modes

e290951

WIP: More output modes

2056539

Merge remote-tracking branch 'origin/main' into output-modes

bceba19

# Conflicts: # tests/models/test_openai.py

Fix tests

0cb25c4

Remove syntax invalid before Python 3.12

933b74e

Fix tests

7974df0

Add TextOutput marker

9cc19e2

Merge remote-tracking branch 'origin/main' into output-modes

bc6bb65

# Conflicts: # pydantic_ai_slim/pydantic_ai/_output.py # pydantic_ai_slim/pydantic_ai/models/openai.py # tests/models/test_openai.py

Add VCR recording of new test

0e356a3

Implement additional output modes in GeminiModel and GoogleModel

81312dc

Fix prompted_json on OpenAIResponses

52ef4d5

Test output modes on Gemini and Anthropic

fe05956

Add VCR recordings of Gemini output mode tests

94421f3

Remove some old TODO comments

1902d00

Add missing VCR recording of Gemini output mode test

1f53c9b

Add more missing VCR recordings

a4c2877

Fix OpenAI tools

56e58f9

Improve test coverage

a5234e1

Update unsupported output mode error message

40def08

Improve test coverage

837d305

Merge branch 'main' into output-modes

3598bef

Test streaming with structured text output

5f71ba8

Make TextOutputFunction Python 3.9 compatible

cfc2749

Properly merge JSON schemas accounting for defs

a137641

Refactor output schemas and modes: more 'isinstance(output_schema, ..…

f495d46

….)', less 'output_schema.mode == ...'

Merge branch 'main' into output-modes

449ed0d

# Conflicts: # pydantic_ai_slim/pydantic_ai/models/openai.py # pydantic_ai_slim/pydantic_ai/profiles/openai.py # tests/models/test_google.py # tests/models/test_openai_responses.py

Clean up some variable names

e70d249

Improve test coverage

4592b0b

Merge branch 'main' into output-modes

db1c628

# Conflicts: # pydantic_ai_slim/pydantic_ai/_utils.py # pydantic_ai_slim/pydantic_ai/agent.py # tests/models/test_anthropic.py # tests/test_utils.py

Combine JsonSchemaOutput and PromptedJsonOutput into StructuredTextOu…

f57d078

…tput