Input truncation is automatic on ollama you need to add a single flag to fix it #1403

devlux76 · 2025-03-04T09:54:59Z

devlux76
Mar 4, 2025

Which version of the app are you using?

latest head

Which API Provider are you using?

Ollama

Which Model are you using?

All of them

What happened?

Thanks for an excellent product. I love that it integrates with so many providers, but the ollama integration is bugged due to an incredibly short sighted design decision of the ollama team that I don't think you're aware of.

time=2025-03-04T02:30:04.818-07:00 level=WARN source=runner.go:129 msg="truncating input prompt" limit=2048 prompt=16101 keep=4 new=2048

Ollama by default truncates input to 2048 tokens this is really quite tiny.

In order to override this all you need to do is to set the limit in the message options to something reasonable. There is no reason not to use the model's context length - new for this.

"options": {
    "num_ctx": 128000
  }

This can of course cause resource issues, so best practice is to measure what you current have, plus the size of the expected generation and then cap it at the model's max context size. This way you don't waste a bunch of resources when you only need a much smaller context at the moment.

Source: https://github.com/ollama/ollama/blob/main/docs/faq.md

Steps to reproduce

Relevant API REQUEST output

Additional context

No response

devlux76 · 2025-03-04T10:17:51Z

devlux76
Mar 4, 2025
Author

Just a quick update. On a hunch I tried the "openai compatible" provider, but it's the same story...

time=2025-03-04T03:15:35.453-07:00 level=WARN source=runner.go:129 msg="truncating input prompt" limit=2048 prompt=16101 keep=4 new=2048

0 replies

hannesrudolph · 2025-03-05T19:34:44Z

hannesrudolph
Mar 5, 2025
Collaborator

Can you please provide me a specific model so I can download it and try to repro it? Also can you provide your model config settings screenshot and a screenshot of the chat session where it fails?

EDIT: Sorry I had a swear word in there I did not mean to include. I should read what I type!. Sorry @devlux76

0 replies

mrubens · 2025-03-05T19:46:50Z

mrubens
Mar 5, 2025
Maintainer

I know what you mean @devlux76 and I think it would help a lot with the ollama integration so you don't have to manually create a new model every time (i.e. remove step 3 from https://docs.roocode.com/providers/ollama#setting-up-ollama). Let me convert this to a feature request and we can keep discussing.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Input truncation is automatic on ollama you need to add a single flag to fix it #1403

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Input truncation is automatic on ollama you need to add a single flag to fix it #1403

Uh oh!

devlux76 Mar 4, 2025

Which version of the app are you using?

Which API Provider are you using?

Which Model are you using?

What happened?

Steps to reproduce

Relevant API REQUEST output

Additional context

Replies: 3 comments

Uh oh!

devlux76 Mar 4, 2025 Author

Uh oh!

Uh oh!

hannesrudolph Mar 5, 2025 Collaborator

Uh oh!

mrubens Mar 5, 2025 Maintainer

devlux76
Mar 4, 2025

devlux76
Mar 4, 2025
Author

hannesrudolph
Mar 5, 2025
Collaborator

mrubens
Mar 5, 2025
Maintainer