Skip to content

Commit b76ba2e

Browse files
committed
[Bugfix] Fix topk_ids indices_type for cutlass w8a8 fp8 moe
Signed-off-by: Ming Yang <yming@meta.com>
1 parent ead3698 commit b76ba2e

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

vllm/model_executor/layers/quantization/compressed_tensors/compressed_tensors_moe.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -605,8 +605,7 @@ def apply(
605605
num_expert_group=num_expert_group,
606606
custom_routing_function=custom_routing_function,
607607
scoring_func=scoring_func,
608-
e_score_correction_bias=e_score_correction_bias,
609-
indices_type=torch.uint32)
608+
e_score_correction_bias=e_score_correction_bias)
610609

611610
return self.fused_experts(
612611
x,

0 commit comments

Comments
 (0)