💥 Expose agent testing utils #1164

donald-pinckney · 2025-10-15T17:44:34Z

What was changed

Adds a new temporalio.contrib.openai_agents.test module, for keeping utilities to assist in writing tests of agents.
Moves TestModel and TestModelProvider from temporalio.contrib.openai_agents to temporalio.contrib.openai_agents.test (this is a breaking change 💥)
Publicly exposes StaticTestModel and ResponseBuilders in temporalio.contrib.openai_agents.test

Why?

Writing tests of agentic code requires boilerplate setup of model mocks. This is a first attempt to make this easier for users.

Checklist

How was this tested: updated existing unit tests to use the temporalio.contrib.openai_agents.test module.
Any docs updates needed? No, there are currently no docs for testing utils. Let's add in later PRs after building samples.

CLAassistant · 2025-10-15T17:44:42Z

All committers have signed the CLA.

tests/contrib/openai_agents/test_openai.py

tconley1428 · 2025-10-17T16:19:58Z

temporalio/contrib/openai_agents/testing.py

+        raise NotImplementedError()
+
+
+class StaticTestModel(TestModel):


I liked the idea of changing this to a static factory on testmodel

Just pushed the static factory method.

temporalio/contrib/openai_agents/testing.py

cretz · 2025-10-17T16:31:11Z

temporalio/contrib/openai_agents/testing.py

A bit hard to see from this PR how this looks from a user POV. One reason we did "ActivityEnvironment" and "WorkflowEnvironment" instead of only the building blocks is because users like the nice simplicity of one-liners and reusable constructs. I'm wondering if there's an opportunity to design something here. If not too much trouble, can I see what tests/openai_agents/basic/test_hello_world_workflow.py will look like using these utilities?

Part of me wonders if we can have an AgentEnvironment that basically accepts everything the plugin accepts and also some of this mock stuff. So maybe something like:

from temporalio.contrib.openai_agents.testing import AgentEnvironment # ... async def test_hello_world_agent_workflow(client: Client): async def on_model_call(req: WhateverOpenAIRequestType) -> WhateverOpenAIResponseType: # Do some stuff # on_model_call is just an advanced example, accepting direct mocks can # in this constructor be allowed too async with AgentEnvironment(on_model_call=on_model_call) as env: # Applies plugin and such (which is also available on env.plugin if you want it) client = env.applied_on_client(client) # Rest of the stuff w/ worker and such

Currently (with the change to static factory method I just pushed), that test would look like:

@pytest.fixture def test_model(): return TestModel.returning_responses( [ResponseBuilders.output_message("This is a haiku (not really)")] ) async def test_execute_workflow(client: Client): task_queue_name = str(uuid.uuid4()) async with Worker( client, task_queue=task_queue_name, workflows=[HelloWorldAgent], activity_executor=ThreadPoolExecutor(5), ): result = await client.execute_workflow( HelloWorldAgent.run, "Write a recursive haiku about recursive haikus.", id=str(uuid.uuid4()), task_queue=task_queue_name, ) assert isinstance(result, str) assert len(result) > 0

client is a fixture that depends on the test_model fixture, so you can override the test_model fixture per test or per module.

I think for most users this is missing the client and plugin configuration which I think we should make easy for testers too. I think to show the full code to compare, you'd have to include your other fixtures like client configuration and plugin creation. Those fixtures are a little pytest specific and external to the test and not really have we have done test helpers in the past. I guess I was thinking something you could easily configure inside your test for each test (but still share if you want). Basically you need an easy way to configure an existing client with the plugin and model stuff.

donald-pinckney changed the title ~~[WIP] Expose agent testing utils to users~~ [WIP] Expose agent testing utils Oct 15, 2025

donald-pinckney commented Oct 15, 2025

View reviewed changes

tests/contrib/openai_agents/test_openai.py Outdated Show resolved Hide resolved

donald-pinckney changed the title ~~[WIP] Expose agent testing utils~~ Expose agent testing utils Oct 15, 2025

donald-pinckney marked this pull request as ready for review October 15, 2025 19:37

donald-pinckney requested a review from a team as a code owner October 15, 2025 19:37

donald-pinckney changed the title ~~Expose agent testing utils~~ 💥 Expose agent testing utils Oct 15, 2025

tconley1428 reviewed Oct 17, 2025

View reviewed changes

temporalio/contrib/openai_agents/testing.py Outdated Show resolved Hide resolved

cretz reviewed Oct 17, 2025

View reviewed changes

donald-pinckney added 12 commits October 17, 2025 15:13

Expose some agent testing utils to users

28d9283

rename test -> testing

ab762c4

update import sites

ed44c49

fix lints

30b6fbc

cleanup diff

73774a0

fmt

7a9a8d1

Add experimental warnings to docs

569fc64

Change to factory method

d249066

fmt

3f0908d

cleanup

b72dd77

fmt

296cc3c

lints

d3010d2

donald-pinckney force-pushed the d/20251015-125526 branch from a31afee to d3010d2 Compare October 17, 2025 19:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

💥 Expose agent testing utils #1164

💥 Expose agent testing utils #1164

Uh oh!

donald-pinckney commented Oct 15, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Oct 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

tconley1428 Oct 17, 2025

Uh oh!

donald-pinckney Oct 17, 2025

Uh oh!

Uh oh!

cretz Oct 17, 2025 •

edited

Loading

Uh oh!

donald-pinckney Oct 17, 2025 •

edited

Loading

Uh oh!

cretz Oct 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		raise NotImplementedError()


		class StaticTestModel(TestModel):

💥 Expose agent testing utils #1164

Are you sure you want to change the base?

💥 Expose agent testing utils #1164

Uh oh!

Conversation

donald-pinckney commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What was changed

Why?

Checklist

Uh oh!

CLAassistant commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tconley1428 Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

donald-pinckney Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cretz Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

donald-pinckney Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cretz Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

donald-pinckney commented Oct 15, 2025 •

edited

Loading

CLAassistant commented Oct 15, 2025 •

edited

Loading

cretz Oct 17, 2025 •

edited

Loading

donald-pinckney Oct 17, 2025 •

edited

Loading

cretz Oct 17, 2025 •

edited

Loading