Update hpu_model_runner.py #1374

michalkuligowski · 2025-06-05T10:54:13Z

No description provided.

michalkuligowski · 2025-06-11T13:48:22Z

/run-gaudi-tests

vllm/worker/hpu_model_runner.py

madamczyk-intel · 2025-06-12T10:24:51Z

Why we need this change anyway? Apparently as of now, bucketing context constructors handle generating prompt buckets automatically: https://github.com/HabanaAI/vllm-hpu-extension/blob/main/vllm_hpu_extension/bucketing/linear.py#L52

michalkuligowski · 2025-06-12T11:05:25Z

This comes from the fix that was introduced sometime ago (April) and was removed by mistake during one of the rebases to upstream.
/run-gaudi-tests

This comes from the fix that was introduced sometime ago (April) and was removed by mistake during one of the rebases to upstream.

michalkuligowski · 2025-06-18T07:23:02Z

/skip-gaudi-tests

Follow-up for #1374

michalkuligowski added 4 commits June 5, 2025 12:53

Update hpu_model_runner.py

1c010e2

Update hpu_model_runner.py

e7c69da

Merge branch 'habana_main' into michalkuligowski-patch-exp-llama-degr

cecc15f

Update hpu_model_runner.py

27040cb

michalkuligowski marked this pull request as ready for review June 11, 2025 13:32

michalkuligowski requested review from kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, vivekgoe, afierka-intel, xuechendi, jikunshang and mswiniarsk as code owners June 11, 2025 13:32

michalkuligowski added 2 commits June 11, 2025 15:47

Update hpu_model_runner.py

4bafd31

Merge branch 'habana_main' into michalkuligowski-patch-exp-llama-degr

a43339a

madamczyk-intel previously requested changes Jun 12, 2025

View reviewed changes

vllm/worker/hpu_model_runner.py Show resolved Hide resolved

michalkuligowski added 2 commits June 12, 2025 13:04

Update hpu.txt

15d5c89

Merge branch 'habana_main' into michalkuligowski-patch-exp-llama-degr

4b52b19

michalkuligowski mentioned this pull request Jun 16, 2025

Update linear.py HabanaAI/vllm-hpu-extension#211

Merged

mgawarkiewicz-intel approved these changes Jun 18, 2025

View reviewed changes

Update hpu.txt

3edb3e1

michalkuligowski merged commit 7a2b6d0 into habana_main Jun 18, 2025
6 checks passed

michalkuligowski deleted the michalkuligowski-patch-exp-llama-degr branch June 18, 2025 07:26

michalkuligowski mentioned this pull request Jun 18, 2025

Generate buckets for prompt for V1 #1451

Merged

michalkuligowski added a commit that referenced this pull request Jun 23, 2025

Generate buckets for prompt for V1 (#1451)

70fd38e

Follow-up for #1374

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update hpu_model_runner.py #1374

Update hpu_model_runner.py #1374

Uh oh!

michalkuligowski commented Jun 5, 2025 •

edited by github-actions bot

Loading

Uh oh!

michalkuligowski commented Jun 11, 2025

Uh oh!

Uh oh!

madamczyk-intel commented Jun 12, 2025

Uh oh!

michalkuligowski commented Jun 12, 2025 •

edited

Loading

Uh oh!

michalkuligowski commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

Update hpu_model_runner.py #1374

Update hpu_model_runner.py #1374

Uh oh!

Conversation

michalkuligowski commented Jun 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michalkuligowski commented Jun 11, 2025

Uh oh!

Uh oh!

madamczyk-intel commented Jun 12, 2025

Uh oh!

michalkuligowski commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michalkuligowski commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

michalkuligowski commented Jun 5, 2025 •

edited by github-actions bot

Loading

michalkuligowski commented Jun 12, 2025 •

edited

Loading