Skip to content

Commit 1567180

Browse files
committed
[Bugfix] Fix topk_ids indices_type for cutlass w8a8 fp8 moe
Signed-off-by: Ming Yang <yming@meta.com>
1 parent cd4cfee commit 1567180

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

vllm/model_executor/layers/quantization/compressed_tensors/compressed_tensors_moe.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -621,8 +621,7 @@ def apply(
621621
num_expert_group=num_expert_group,
622622
custom_routing_function=custom_routing_function,
623623
scoring_func=scoring_func,
624-
e_score_correction_bias=e_score_correction_bias,
625-
indices_type=torch.uint32)
624+
e_score_correction_bias=e_score_correction_bias)
626625

627626
return self.fused_experts(
628627
x,

0 commit comments

Comments
 (0)