Python: Emit token usage with streaming chat completion agent. #12416

moonbox3 · 2025-06-09T04:14:35Z

Motivation and Context

The chat completion agent was not emitting token using during streaming invocation because we were only allowing through response.items. In the case of token usage, response.items is [] and the usage is contained as part of the message's metadata dict. This PR fixes that bug and allows for response.items or response.metadata.get("usage"). Two new samples are added to the concepts/agents/chat_completion dir to show how one can track token use for streaming and non-streaming agent invocation. Token usage handling is also added to the chat completion agent integration tests.

Description

Fixes a bug where we weren't emitting the streaming token usage for the chat completion agent. Also now includes the prompt_tokens_details and completion_tokens_details models that are returned, but not previously handled.
Adds new samples
Updates integration tests to track token usage and make sure they're non-zero.
Closes Python: Bug: invoke_stream method in chat_completion_agent.py not returning token usage data #12411

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the SK Contribution Guidelines and the pre-submission formatting script raises no violations
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

…d tests.

markwallace-microsoft · 2025-06-09T04:18:10Z

Python Test Coverage Report •

File	Stmts	Miss	Cover	Missing
agents/chat_completion
chat_completion_agent.py	224	28	87%	68, 73, 78, 84, 100, 102, 108–112, 170, 173, 183, 191, 194, 196, 205, 231, 250, 252–253, 255–256, 258, 406, 503, 580
connectors/ai
completion_usage.py	23	8	65%	21, 39–45
TOTAL	27125	4627	82%

Python Unit Test Overview

Tests	Skipped	Failures	Errors	Time
3626	22 💤	0 ❌	0 🔥	2m 2s ⏱️

...samples/concepts/agents/chat_completion_agent/chat_completion_agent_streaming_token_usage.py

Emit token usage with streaming chat completion agent. Add samples an…

4ad13ae

…d tests.

moonbox3 self-assigned this Jun 9, 2025

moonbox3 added this to Semantic Kernel Jun 9, 2025

moonbox3 requested a review from a team as a code owner June 9, 2025 04:14

moonbox3 added python Pull requests for the Python Semantic Kernel agents labels Jun 9, 2025

markwallace-microsoft added the documentation label Jun 9, 2025

TaoChenOSU approved these changes Jun 9, 2025

View reviewed changes

...samples/concepts/agents/chat_completion_agent/chat_completion_agent_streaming_token_usage.py Outdated Show resolved Hide resolved

Use Azure OpenAI in samples

29b18c3

alliscode approved these changes Jun 9, 2025

View reviewed changes

moonbox3 added this pull request to the merge queue Jun 9, 2025

Merged via the queue into microsoft:main with commit 4b23389 Jun 9, 2025
28 checks passed

moonbox3 deleted the chat-complete-agent-stream-usage branch June 9, 2025 23:19

github-project-automation bot moved this to Sprint: Done in Semantic Kernel Jun 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Python: Emit token usage with streaming chat completion agent. #12416

Python: Emit token usage with streaming chat completion agent. #12416

moonbox3 commented Jun 9, 2025 •

edited

Loading

Uh oh!

markwallace-microsoft commented Jun 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Python: Emit token usage with streaming chat completion agent. #12416

Python: Emit token usage with streaming chat completion agent. #12416

Conversation

moonbox3 commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Description

Contribution Checklist

Uh oh!

markwallace-microsoft commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Python Unit Test Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

moonbox3 commented Jun 9, 2025 •

edited

Loading

markwallace-microsoft commented Jun 9, 2025 •

edited

Loading