Skip to content

Commit 66807e0

Browse files
harygo22weijinqian_v1
authored andcommitted
fix bug
Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
1 parent b02ad40 commit 66807e0

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

vllm_ascend/models/moe_block.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -98,10 +98,8 @@ def forward(
9898
is_prefill = True
9999
enable_force_load_balance = True
100100
else:
101-
is_prefill = False
101+
is_prefill = get_forward_context().with_prefill
102102
enable_force_load_balance = False
103-
if hasattr(attn_metadata, 'with_prefill_across_dp'):
104-
is_prefill = attn_metadata.with_prefill_across_dp
105103

106104
# router_logits: (num_tokens, n_experts)
107105
router_logits, _ = self.gate(hidden_states)

0 commit comments

Comments
 (0)