WARNING 04-15 15:50:49 config.py:211] awq quantization is not fully optimized yet. The speed can be slower than non-quantized models.? #4101

silvacarl2 · 2024-04-15T22:52:12Z

silvacarl2
Apr 15, 2024

Is this anytihng we need to be concerned about during inferencing?

WARNING 04-15 15:50:49 config.py:211] awq quantization is not fully optimized yet. The speed can be slower than non-quantized models.

because it seems REALLY REALLY FAST already.

8-)