We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent ead3698 commit b76ba2eCopy full SHA for b76ba2e
vllm/model_executor/layers/quantization/compressed_tensors/compressed_tensors_moe.py
@@ -605,8 +605,7 @@ def apply(
605
num_expert_group=num_expert_group,
606
custom_routing_function=custom_routing_function,
607
scoring_func=scoring_func,
608
- e_score_correction_bias=e_score_correction_bias,
609
- indices_type=torch.uint32)
+ e_score_correction_bias=e_score_correction_bias)
610
611
return self.fused_experts(
612
x,
0 commit comments