Replies: 1 comment
-
It turns out the main culprit was the 'enforce_isolation' option on kernel 6.12. Turning it off eliminates most of the lagging, although I still get some stuttering on VAE processing and at the very beginning of prompt processing. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
On my aging 3400G, the whole desktop GUI (Linux, either X11+XFCE or Wayland+KDE) tends to freeze completely during llama.cpp/stable-diffusion.cpp more intensive GPU computations (on Vulkan). From low to high impact:
Also, these 'choking' events sometimes trigger driver bugs, causing full system lock-ups.
So, I'm looking for ways to throttle GPU usage during inference. What I tried so far:
ctx->device->device.waitIdle()
+ sleep beforeggml_vk_build_graph
calls: kind of works as a proof-of-concept thing, but of course is no real solution.Thoughts?
Beta Was this translation helpful? Give feedback.
All reactions