quantization errors: is this really a GGML file? #7171

93041025 · 2024-05-09T08:53:14Z

93041025
May 9, 2024

(base) server@Server:~/llama.cpp$ ./quantize ./models/Phi-3-mini-4k-instruct/ggml-model-f16.gguf ./models/Phi-3-mini-4k-instruct/ggml-model-Q4_K_M.gguf Q4_K_M
main: build = 913 (eb542d3)
main: quantizing './models/Phi-3-mini-4k-instruct/ggml-model-f16.gguf' to './models/Phi-3-mini-4k-instruct/ggml-model--Q4_K_M.gguf' as Q4_K_M
llama.cpp: loading model from ./models/Phi-3-mini-4k-instruct/ggml-model-f16.gguf
llama_model_quantize: failed to quantize: unknown (magic, version) combination: 46554747, 00000003; is this really a GGML file?
main: failed to quantize model from './models/Phi-3-mini-4k-instruct/ggml-model-f16.gguf'

I did the conversion of phi-3 to gguf and it was successful. However, when I tried to quantize this gguf file to 4 bits, I encountered an error. Could anyone provide a suggestion on how to resolve this quantization issue? Thank you in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

quantization errors: is this really a GGML file? #7171

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

quantization errors: is this really a GGML file? #7171

Uh oh!

93041025 May 9, 2024

Replies: 0 comments

93041025
May 9, 2024