Skip to content

Commit aa20d10

Browse files
[Misc] [ROCm] Prevent surplus tensor reshape (#19803)
Signed-off-by: Zsolt Borbely <zsolt.borbely@htecgroup.com>
1 parent 2de12be commit aa20d10

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/v1/attention/backends/triton_attn.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -376,7 +376,7 @@ def forward(
376376
query.reshape(
377377
(num_tokens, num_heads * head_size)).contiguous(),
378378
layer._q_scale)
379-
query = query.reshape((num_tokens, num_heads, head_size))
379+
query = query.reshape((num_tokens, num_heads, head_size))
380380

381381
use_local_attn = \
382382
(self.use_irope and attn_metadata.local_attn_metadata is not None)

0 commit comments

Comments
 (0)