Handle 413 prompt too large response from APIs like vertex gracefully #2277

ianlintner · 2025-04-03T22:02:01Z

ianlintner
Apr 3, 2025

As a user if roo detects a prompt too long response rather than getting stuck in a retry loop could handle it automatically without user input.

e.g. From google vertex ai

413 {"type":"error","error":{"type":"invalid_request_error","message":"Prompt is too long"}}

ericfitz · 2025-05-24T14:30:16Z

ericfitz
May 24, 2025

+1

It's not just vertex AI (Google's models have a 1M token window, one of the largest). It's worse on Anthropic's models that have a 200K token window - trying to read one nontrivial npm lock file will blow that away every time. I have specifically changed my prompts for one project to exclude lockfiles for this reason.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle 413 prompt too large response from APIs like vertex gracefully #2277

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Handle 413 prompt too large response from APIs like vertex gracefully #2277

Uh oh!

Uh oh!

ianlintner Apr 3, 2025

Replies: 1 comment

Uh oh!

ericfitz May 24, 2025

ianlintner
Apr 3, 2025

ericfitz
May 24, 2025