Skip to content

Commit 37bd8d6

Browse files
authored
[Bug] DeepGemm: Fix TypeError: per_block_cast_to_fp8() missing 1 required positional argument: 'use_ue8m0' for SM100 (#21187)
Signed-off-by: yewentao256 <zhyanwentao@126.com>
1 parent 468e240 commit 37bd8d6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/utils/deep_gemm.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -99,7 +99,7 @@ def fp8_m_grouped_gemm_nt_masked(*args, **kwargs):
9999

100100
def per_block_cast_to_fp8(x, *args, **kwargs):
101101
if _per_block_cast_impl is not None and is_blackwell_deep_gemm_used():
102-
return _per_block_cast_impl(x)
102+
return _per_block_cast_impl(x, use_ue8m0=True)
103103
# TODO: refactor the `per_block_cast_to_fp8` from tests to vllm utils
104104
from tests.kernels.quant_utils import per_block_cast_to_fp8 as _pbcf
105105
return _pbcf(x, *args, **kwargs)

0 commit comments

Comments
 (0)