check FIM response for expected format #73

pnb · 2025-07-02T20:32:16Z

Partially fixes #61 by handling errors gracefully (though it will not help users discover that their entire server is set up incorrectly). ~~Also does not explain why llama-server sometimes has sequence 0 does not start from the last position stored in the memory errors.~~ But, it does handle unexpected server responses gracefully.

This approach handles unexpected issues upon server response in fim_on_response, avoiding taking up slots in the cache with invalid responses. You can test it with a couple of examples by entering these commands:

Not valid JSON (endpoint returns HTML)

:let g:llama_config['endpoint'] = 'https://example.com'

Valid JSON missing the `content` key

:let g:llama_config['endpoint'] = 'https://dummyjson.com/posts/add'

It might be overkill to check the JSON string before decoding, I was just worried about small performance hits from unnecessarily doing full JSON decodes, especially since this can happen on every keypress if the responses are invalid, since there will never be cache hits.

ggerganov · 2025-07-03T07:06:50Z

Also does not explain why llama-server sometimes has sequence 0 does not start from the last position stored in the memory errors.

Which version of llama-server and model are you using when you get this error?

ggerganov · 2025-07-03T07:11:46Z

Also does not explain why llama-server sometimes has sequence 0 does not start from the last position stored in the memory errors.

Which version of llama-server and model are you using when you get this error?

FYI, these 2 changes from last week should have fixed the problem:

If you still spot the error with a build that includes these fixes, let me know.

pnb · 2025-07-03T14:20:08Z

Now that you mention it, I haven't seen that "sequence 0" error yet this week. I update the server almost daily, so that probably explains it. I was using Qwen 2.5 14B coder, by the way.

However, this PR wasn't designed to fix only that issue specifically, but more generally handle the case of unexpected server responses. I still get those almost daily when running over a network, as my home wifi is OK but not amazing. So it would still be useful to merge this, I think.

check FIM response for expected format

ff760a8

pnb mentioned this pull request Jul 2, 2025

Key not present in Dictionary: "content" #61

Closed

ggerganov approved these changes Jul 3, 2025

View reviewed changes

ggerganov merged commit f886bad into ggml-org:master Jul 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

check FIM response for expected format #73

check FIM response for expected format #73

Uh oh!

pnb commented Jul 2, 2025 •

edited

Loading

Uh oh!

ggerganov commented Jul 3, 2025

Uh oh!

ggerganov commented Jul 3, 2025 •

edited

Loading

Uh oh!

pnb commented Jul 3, 2025

Uh oh!

Uh oh!

check FIM response for expected format #73

check FIM response for expected format #73

Uh oh!

Conversation

pnb commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Not valid JSON (endpoint returns HTML)

Valid JSON missing the content key

Uh oh!

ggerganov commented Jul 3, 2025

Uh oh!

ggerganov commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pnb commented Jul 3, 2025

Uh oh!

Uh oh!

pnb commented Jul 2, 2025 •

edited

Loading

Valid JSON missing the `content` key

ggerganov commented Jul 3, 2025 •

edited

Loading