How to use 7-bit integer quantization? #5526
Unanswered
GermanAizek
asked this question in
Q&A
Replies: 1 comment
-
If you use the available Q6_K type quantization you'll end up with around 6.6 bpw. It shows performance equal to fp8 and very similar to fp16. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
ٴ
Beta Was this translation helpful? Give feedback.
All reactions