Skip to content

Commit cf3f1c8

Browse files
author
weijinqian_v1
committed
[Feature]Moe alltoallv communication optimization for unquantized RL training sence & alltoallv support dpo
Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
1 parent 47439e8 commit cf3f1c8

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

vllm_ascend/ops/fused_moe.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -565,10 +565,8 @@ def fused_experts_with_all2allv(token_dispatcher, probs, routing_map, hidden_sta
565565
(share_experts_output, dispatched_input, tokens_per_expert) = token_dispatcher.token_permutation(
566566
hidden_states, probs, routing_map
567567
)
568-
hidden_states_wrapper = [dispatched_input]
569-
del dispatched_input
570568

571-
expert_output = apply_mlp(hidden_states_wrapper,
569+
expert_output = apply_mlp(hidden_states,
572570
w1,
573571
w2,
574572
tokens_per_expert)

0 commit comments

Comments
 (0)