-
Notifications
You must be signed in to change notification settings - Fork 4.2k
OpenCL issues - RTX 4070 x64 #3077
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
More info: ggml-org/llama.cpp#10693 |
Thanks - what about Kompute? Any limitations there? |
The kompute backend has not been maintained for a very long time, so it's not relevant atm. |
It would be highly beneficial if there was a public table of platform status by platform I mean, ggml_backend_load_all in ggml-backend-reg.cpp tries this...
Plus an optional ggml_backend_load(backend_path); Here's what I currently know I've got blas, cuda, vulkan and rpc ATM on Windows (+ todo on Linux) Not touched cann (Huawei Mobile?), hip, sycl or musa (looks old) yet - any of these good (and on what platforms?) Raspberry Pi = cpu only (in tests) Planning AMD + CUDA tests on cheaper AWS Linux instances (once Windows/Mac via Pascal all good) |
With whisper-bench I'm getting...
Compiled with MSVC Win11 ...
Is OpenCL only for phones or something? I've run some OpenCL tests in the past so know OpenCL works, just GGML has problems
OK, CUDA is going to be faster - I just want to run comparisons. I've also got Vulkan support compiled in and that's only slightly slower than CUDA
The Adreno flag sort hints at a possible issue I guess but can't find anything saying it WONT work.
Possibly vckpg's opencl is an issue? [it's openvino was an issue]
The text was updated successfully, but these errors were encountered: