Skip to content

Commit 6f6efc1

Browse files
author
weijinqian_v1
committed
[Feature]Moe alltoallv communication optimization for unquantized RL training sence & alltoallv support dpo
Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
1 parent 807aaf0 commit 6f6efc1

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm_ascend/ops/fused_moe.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -566,7 +566,7 @@ def fused_experts_with_all2allv(token_dispatcher, probs, routing_map, hidden_sta
566566
hidden_states, probs, routing_map
567567
)
568568

569-
expert_output = apply_mlp(hidden_states,
569+
expert_output = apply_mlp(dispatched_input,
570570
w1,
571571
w2,
572572
tokens_per_expert)

0 commit comments

Comments
 (0)