Does imatrix work independent of the dtype used for calibration? #7538

bartowski1182 · 2024-05-25T17:51:07Z

bartowski1182
May 25, 2024

IE, if I convert to F32 and BF16, calculate the imatrix from F32 (since it supports GPU offloading), but then apply it to the BF16 weights during quantization, will that work as expected or will the change in dtype affect it?

@ikawrakow sorry for the ping but after searching couldn't find any way to reach out to you directly and I imagine you're the most well versed on this subject..

Similarly, if F16 for imatrix and BF16 for quantization, curious how this would work

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Does imatrix work independent of the dtype used for calibration? #7538

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Does imatrix work independent of the dtype used for calibration? #7538

Uh oh!

bartowski1182 May 25, 2024

Replies: 0 comments

bartowski1182
May 25, 2024