WARNING 04-15 15:50:49 config.py:211] awq quantization is not fully optimized yet. The speed can be slower than non-quantized models.? #4101
silvacarl2
announced in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Is this anytihng we need to be concerned about during inferencing?
WARNING 04-15 15:50:49 config.py:211] awq quantization is not fully optimized yet. The speed can be slower than non-quantized models.
because it seems REALLY REALLY FAST already.
8-)
Beta Was this translation helpful? Give feedback.
All reactions