Replies: 2 comments 3 replies
-
Likely a misconfiguration issue. I'm using Qwen 2.5 32B daily without any issues. |
Beta Was this translation helpful? Give feedback.
3 replies
-
Is your binary up to date, few weeks ago crash was linked to kv cache quantization + defrag |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I've try it with Qwen 2.5 32B/72B, perfect for single turn Q&A, but in multi-turn conversation, it got some bugs, like core dumped, or after normal output for a while, it start output random strings, some times it end, sometimes not.
For "random strings output bug", I can make a repeatable case, just wonder is it worth?
I'm use Open Web UI as frontend, and the bug won't occurs when switch to aphrodite-engine.
I've try K/V cache with default(f16) q8_0 q4_0, all of them had "random strings output bug", the later two had "core dumped bug"
Beta Was this translation helpful? Give feedback.
All reactions