Skip to content

Commit d24758e

Browse files
harygo22weijinqian_v1
authored andcommitted
fix a bug
Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
1 parent 66807e0 commit d24758e

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm_ascend/ascend_forward_context.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ def get_fused_moe_state(ep_size: int, with_prefill: bool):
2222
if ep_size == 1:
2323
return FusedMoEState.AllGather
2424
elif envs_ascend.VLLM_ASCEND_ENABLE_MOE_ALL2ALL_SEQ:
25-
return FusedMoEState.All2AllSeq
25+
return FusedMoEState.All2AllSeq if ep_size < 16 else FusedMoEState.MC2
2626
# NOTE: mc2 need ep_size >= 16 & all2all can't use in torchair graph.
2727
elif ep_size < 16 or with_prefill:
2828
return FusedMoEState.All2All

0 commit comments

Comments
 (0)