Replies: 1 comment
-
FYI: https://siboehm.com/articles/22/CUDA-MMM. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Is it possible to compile a llama binary without it requiring
cublas64_12.dll
andcublasLt64_12.dll
in runtime?cudart64_12.dll
is tiny, but cublas is around half a gig! I don't want to ship it with my app neither I want to make users install CUDA toolkit (cublas is not found when installing usual Nvidia drivers).I tried setting
-DGGML_CUDA_FORCE_MMQ=ON
, but it still crashes because it can't findcublas64_12.dll
in runtime.Beta Was this translation helpful? Give feedback.
All reactions