use llama.cpp quantization with QAT finetune #11170

anyinlover · 2025-01-10T04:46:41Z

anyinlover
Jan 10, 2025

I'm finetuning a 1B model with QAT using torchtune library, but I wonder is it match to llama.cpp quantization algorithm, does anyone have some experience?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

use llama.cpp quantization with QAT finetune #11170

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

use llama.cpp quantization with QAT finetune #11170

Uh oh!

anyinlover Jan 10, 2025

Replies: 0 comments

anyinlover
Jan 10, 2025