remove extra usage additions during yielding in gemini agent #1577

almeidaalajoel · 2025-04-24T02:21:54Z

The gemini Agent is overcounting tokens by adding the usage for each chunk returned from the gemini client. Each chunk has the total usage data up to that chunk.

Instead of summing the usage, we should just track what the response said the usage was at each point.

almeidaalajoel · 2025-04-24T02:24:06Z

hmm failing tests, it may be a more complex fix? is it gemini side with the issue?

looks like this may not be a perfect fix but i wanted to highlight the issue

almeidaalajoel · 2025-04-24T02:50:10Z

For reference, printing each of the r objects as they come in looks like this for gemini-2.5-pro-exp:

{'candidates': [{'content': {'role': 'model', 'parts': [{'text': '...'}]}, 'index': 0}], 'usage_metadata': {'prompt_token_count': 21, 'candidates_token_count': 625, 'total_token_count': 646}, 'model_version': 'gemini-2.5-pro-preview-03-25'}

{'candidates': [{'content': {'role': 'model', 'parts': [{'text': '...'}]}, 'index': 0}], 'usage_metadata': {'prompt_token_count': 21, 'candidates_token_count': 650, 'total_token_count': 671}, 'model_version': 'gemini-2.5-pro-preview-03-25'}

{'candidates': [{'content': {'role': 'model', 'parts': [{'text': ' ...'}]}, 'index': 0}], 'usage_metadata': {'prompt_token_count': 21, 'candidates_token_count': 676, 'total_token_count': 697}, 'model_version': 'gemini-2.5-pro-preview-03-25'}

Clearly, the prompt_token_count should not be getting re-added with every chunk that comes in. And you can also see that the candidates_token_count is reporting the total up to the current chunk, not just the current chunk.

However, testing with 2.0 and 1.5 it looked like this:

{'candidates': [{'content': {'role': 'model', 'parts': [{'text': ''}]}}], 'usage_metadata': {'prompt_token_count': 21, 'total_token_count': 21}, 'model_version': 'gemini-1.5-flash'}

{'candidates': [{'content': {'role': 'model', 'parts': [{'text': '...'}]}}], 'usage_metadata': {'prompt_token_count': 21, 'total_token_count': 21}, 'model_version': 'gemini-1.5-flash'}

{'candidates': [{'content': {'role': 'model', 'parts': [{'text': '...'}]}}], 'usage_metadata': {'prompt_token_count': 21, 'total_token_count': 21}, 'model_version': 'gemini-1.5-flash'}

{'candidates': [{'content': {'role': 'model', 'parts': [{'text': '...'}]}}], 'usage_metadata': {'prompt_token_count': 21, 'total_token_count': 21}, 'model_version': 'gemini-1.5-flash'}

{'candidates': [{'content': {'role': 'model', 'parts': [{'text': "..."}]}}], 'usage_metadata': {'prompt_token_count': 21, 'total_token_count': 21}, 'model_version': 'gemini-1.5-flash'}

(removed text for easier reading, but it was quite clear that the number of tokens in each chunk was not the amount reported)

Not reporting the candidates_token_count at all (but the prompt_token_count would still be off here). Not sure if this is an issue from google reporting different things on different models?

almeidaalajoel · 2025-04-24T19:22:13Z

https://ai.google.dev/api/generate-content#UsageMetadata

indeed the candidates_token_count is summing across all candidates, so summing them again here does not make sense

DouweM · 2025-04-30T19:31:33Z

@almeidaalajoel Thank you, makes sense if this is documented by Google. Can you see if you can rebase on main and update the failing tests?

amiyapatanaik · 2025-05-12T17:51:49Z

@almeidaalajoel Any updates on this?
@DouweM Seems to be a simple fix, will this be pulled and released soon?

DouweM · 2025-05-13T10:50:48Z

@amiyapatanaik If you've verified the fix works, can you please create a new PR with passing tests so we can close this one and merge your new one instead?

amiyapatanaik · 2025-05-13T16:12:54Z

@amiyapatanaik If you've verified the fix works, can you please create a new PR with passing tests so we can close this one and merge your new one instead?

It still fails the tests. Specifically the test_stream_structured_tool_calls in test_gemini. Not sure what the issue is.

DouweM · 2025-05-19T19:17:54Z

Replaced by #1752

remove extra usage additions during yielding

a50ed62

almeidaalajoel changed the title ~~remove extra usage additions during yielding in gemini client~~ remove extra usage additions during yielding in gemini agent Apr 24, 2025

keep usage along the way

5c4884a

DouweM marked this pull request as draft April 30, 2025 19:31

DouweM added the awaiting author revision label Apr 30, 2025

amiyapatanaik mentioned this pull request May 13, 2025

double counting bug in gemini yield #1711

Closed

amiyapatanaik mentioned this pull request May 16, 2025

Incorrect usage calculation for gemini models in stream mode #1736

Closed

2 tasks

DouweM closed this May 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

remove extra usage additions during yielding in gemini agent #1577

remove extra usage additions during yielding in gemini agent #1577

Uh oh!

almeidaalajoel commented Apr 24, 2025 •

edited

Loading

Uh oh!

almeidaalajoel commented Apr 24, 2025 •

edited

Loading

Uh oh!

almeidaalajoel commented Apr 24, 2025 •

edited

Loading

Uh oh!

almeidaalajoel commented Apr 24, 2025

Uh oh!

DouweM commented Apr 30, 2025

Uh oh!

amiyapatanaik commented May 12, 2025

Uh oh!

DouweM commented May 13, 2025

Uh oh!

amiyapatanaik commented May 13, 2025

Uh oh!

DouweM commented May 19, 2025

Uh oh!

Uh oh!

remove extra usage additions during yielding in gemini agent #1577

remove extra usage additions during yielding in gemini agent #1577

Uh oh!

Conversation

almeidaalajoel commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

almeidaalajoel commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

almeidaalajoel commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

almeidaalajoel commented Apr 24, 2025

Uh oh!

DouweM commented Apr 30, 2025

Uh oh!

amiyapatanaik commented May 12, 2025

Uh oh!

DouweM commented May 13, 2025

Uh oh!

amiyapatanaik commented May 13, 2025

Uh oh!

DouweM commented May 19, 2025

Uh oh!

Uh oh!

almeidaalajoel commented Apr 24, 2025 •

edited

Loading

almeidaalajoel commented Apr 24, 2025 •

edited

Loading

almeidaalajoel commented Apr 24, 2025 •

edited

Loading