Exponential delay on retry should be optional/removed #1360

akmaldju · 2025-02-07T15:28:12Z

akmaldju
Feb 7, 2025

Which version of the app are you using?

v3.3.14

Which API Provider are you using?

Google Gemini

Which Model are you using?

gemini-2.0-flash

What happened?

I'm not sure what was the reason behind introducing exponentialDelay for API requests retry but it goes out of control for Gemini models. Gemini API has 2 cases when API request fail: either because user exceeded the quota per minute or uncontrollable shared quota. The shared quota is applied randomly to everyone when API is "busy" and is randomly removed every few seconds. When such case happens the delay grows from 5 to 40-80 seconds and it ends up just waiting for over a minute without even trying to call API. I don't think there's any API that requires this cooldown period anyways. This feature should either be optional, or removed imho, or at least have the cap of like 30-45 seconds max before the next retry.

Steps to reproduce

Use free tier gemini models from Google AI Studio
Try to develop features for a while until it happens

Relevant API REQUEST output

Additional context

No response

mrubens · 2025-02-07T16:14:06Z

mrubens
Feb 7, 2025
Maintainer

Yeah it makes sense to give users more control over the exponent and the cap, thank you for flagging this.

0 replies

bramburn · 2025-02-09T23:21:36Z

bramburn
Feb 9, 2025

interesting, is anyone else working on this bug?

0 replies

LousyBook94 · 2025-02-10T15:17:15Z

LousyBook94
Feb 10, 2025

yeah, i would love this, like sometimes i would have to wait 200 sec or something for after multiple fails

0 replies

elroy-bot · 2025-02-24T22:07:13Z

elroy-bot
Feb 24, 2025

@mrubens what about just changing this line https://github.com/RooVetGit/Roo-Code/blob/main/src/core/Cline.ts#L1017

from:
const exponentialDelay = Math.ceil(baseDelay * Math.pow(2, retryAttempt))

which (with default requestDelaySeconds of 5) yields retries at:

to:
const exponentialDelay = Math.max(baseDelay, Math.pow(2, retryAttempt))
which would yield:

a few retries at 5s doesn't seem too bad, and if the user sets a more aggressive requestDelaySeconds we still get pretty reasonable set of attempts

0 replies

elroy-bot · 2025-02-24T23:15:54Z

elroy-bot
Feb 24, 2025

especially with the behavior being seen with the Gemini free model, it feels likely that this issue would bite new users more often, so a tweak in default behavior might be preferable to adding more params

0 replies

LousyBook94 · 2025-03-01T05:53:54Z

LousyBook94
Mar 1, 2025

what about clamping the max retry delay to maybe something like 30?

1 reply

nonsleepr Jun 25, 2025

Yes, I'm being throttled by my provider for the past two days and the backoff grows to 1200s.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Exponential delay on retry should be optional/removed #1360

Uh oh!

{{title}}

Uh oh!

Replies: 6 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Exponential delay on retry should be optional/removed #1360

Uh oh!

akmaldju Feb 7, 2025

Which version of the app are you using?

Which API Provider are you using?

Which Model are you using?

What happened?

Steps to reproduce

Relevant API REQUEST output

Additional context

Replies: 6 comments · 1 reply

Uh oh!

mrubens Feb 7, 2025 Maintainer

Uh oh!

bramburn Feb 9, 2025

Uh oh!

LousyBook94 Feb 10, 2025

Uh oh!

elroy-bot Feb 24, 2025

Uh oh!

elroy-bot Feb 24, 2025

Uh oh!

LousyBook94 Mar 1, 2025

Uh oh!

nonsleepr Jun 25, 2025

akmaldju
Feb 7, 2025

Replies: 6 comments 1 reply

mrubens
Feb 7, 2025
Maintainer

bramburn
Feb 9, 2025

LousyBook94
Feb 10, 2025

elroy-bot
Feb 24, 2025

elroy-bot
Feb 24, 2025

LousyBook94
Mar 1, 2025