Replies: 2 comments 1 reply
-
I think that's ARM Cpu thing. |
Beta Was this translation helpful? Give feedback.
1 reply
-
https://github.com/ggerganov/llama.cpp/pull/4966/files It should work on certain ARM platforms with Q4_0 and Q8_0 (at least) as shown in the discussions. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I see references to GGUF_TYPE_INT8 in ggml.c. Is there any particular gguf size that is guaranteed to be in that or does it have to be generated a certain way?
Beta Was this translation helpful? Give feedback.
All reactions