Possible Bug with cached_tokens Usage in PydanticAI When Using Langfuse

### Initial Checks

- [x] I confirm that I'm using the latest version of Pydantic AI
- [x] I confirm that I searched for my issue in https://github.com/pydantic/pydantic-ai/issues before opening this issue

### Description

There may be a bug in how cached_tokens are handled in PydanticAI when using Langfuse.

According to the [Langfuse documentation](https://langfuse.com/docs/model-usage-and-cost):

> Usage types can be arbitrary strings and differ by LLM provider. At the highest level, they can simply be input and output. As LLMs grow more sophisticated, additional usage types are necessary, such as cached_tokens, audio_tokens, and image_tokens.
> 
> In the UI, Langfuse summarizes all usage types that include the string input as input usage types, and similarly those including output as output usage types. If no total usage type is ingested, Langfuse sums up all usage type units to compute the total.

However, in PydanticAI, cached_tokens are currently logged as gen_ai.usage.details.cached_tokens. This causes Langfuse to treat them as an “other” usage type, rather than aggregating them properly. As a result, the total usage and cost calculations may be incorrect in Langfuse.

<img width="263" height="318" alt="Image" src="https://github.com/user-attachments/assets/1c2c3fa0-f6b2-47b7-a9c5-497d80a78a18" />

You can see from the image that cached_tokens is calculated as part of total usage rather than being grouped under input.

I believe this behavior should be adjusted in PydanticAI—perhaps conditionally—when Langfuse is used as the tracing backend (instead of Logfire), to ensure compatibility.

### Example Code

```Python

```

### Python, Pydantic AI & LLM client version

```Text
python: 3.13

"logfire[httpx]>=3.17.0
"pydantic-ai-slim[mcp,openai]>=0.3.5",
"pydantic-ai[logfire]>=0.3.5",
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Possible Bug with cached_tokens Usage in PydanticAI When Using Langfuse #2262

Initial Checks

Description

Example Code

Python, Pydantic AI & LLM client version

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Possible Bug with cached_tokens Usage in PydanticAI When Using Langfuse #2262

Description

Initial Checks

Description

Example Code

Python, Pydantic AI & LLM client version

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions