Quantizing/converting Llama 3.1 on latest master results in an unloadable model. Any thing I'm missing? #9077

objectivecosta · 2024-08-18T13:56:04Z

objectivecosta
Aug 18, 2024

Looks like it should have been solved by #8676 but hasn't?

I built llama.cpp locally on macOS, downloaded the models from HuggingFace (recently) and ran python convert_hf_to_gguf.py ~/ML/llama-3.1-8b-instruct/... this resulted in a model on path ~/ML/llama-3.1-8b-instruct/llama-3.1-8B-instruct-F16.gguf which I then ran with:

./llama-cli -m ~/ML/llama-3.1-8b-instruct/llama-3.1-8B-instruct-F16.gguf -p "You are a helpful assistant" -cnv

The end result it the same some mentioned here: #8650 (comment).

I'm on commit 2339a0be1c8e31fcf4531427183b94f2ef019e56 and I triple checked that my convert_hf_to_gguf.py is up to date (it is).

Answered by objectivecosta

Aug 22, 2024

Updating for whomever needs it: The llama-cli and other binaries in the root folder of the project are not symlinks to the recently built ones. So the llama-cli will never match with the one you just built after running the build commands.

Just build it yourself and run it from the build folder and it should work with the quantized models.

View full answer

objectivecosta · 2024-08-22T10:30:05Z

objectivecosta
Aug 22, 2024
Author

Updating for whomever needs it: The llama-cli and other binaries in the root folder of the project are not symlinks to the recently built ones. So the llama-cli will never match with the one you just built after running the build commands.

Just build it yourself and run it from the build folder and it should work with the quantized models.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Quantizing/converting Llama 3.1 on latest master results in an unloadable model. Any thing I'm missing? #9077

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Quantizing/converting Llama 3.1 on latest master results in an unloadable model. Any thing I'm missing? #9077

Uh oh!

Uh oh!

objectivecosta Aug 18, 2024

Replies: 1 comment

Uh oh!

objectivecosta Aug 22, 2024 Author

objectivecosta
Aug 18, 2024

objectivecosta
Aug 22, 2024
Author