Fixing (1) tensor size mismatch and (2) missing prepare_cos_sin issues for Phi-3.5 #916

mrezavand · 2025-03-14T20:02:52Z

Solving the two issues reported in: https://jira.habana-labs.com/browse/SW-221986

vLLM both offline and online inferencing of the "microsoft/Phi-3.5-mini-instruct" model firstly complained about a missing 'prepare_cos_sin' method, that is necessary for the rotational position embeddings (RoPE):

ERROR 03-11 10:04:32 engine.py:389]   File "/home/vllm-fork/vllm/worker/hpu_model_runner.py", line 424, in _prepare_cos_sin
ERROR 03-11 10:04:32 engine.py:389]     raise AttributeError(
ERROR 03-11 10:04:32 engine.py:389] AttributeError: The module at the end of the path does not have a 'prepare_cos_sin' method.

Model fails due the a size mismatch:

File "/home/vllm-fork/vllm/model_executor/layers/rotary_embedding.py", line 641, in forward
    query_rot = query_rot * cos + _rotate_neox(query_rot) * sin
RuntimeError: The size of tensor a (96) must match the size of tensor b (1024) at non-singleton dimension 3

vllm/model_executor/layers/rotary_embedding.py

michalkuligowski · 2025-03-26T16:04:37Z

/run-gaudi-tests

mrezavand · 2025-03-27T16:17:52Z

@michalkuligowski I just fix the ruff issue.

michalkuligowski · 2025-03-28T08:22:17Z

/run-gaudi-tests

michalkuligowski · 2025-03-31T07:01:14Z

/run-gaudi-tests

michalkuligowski · 2025-03-31T15:55:00Z

/run-gaudi-tests

mrezavand requested review from kzawora-intel, madamczyk-intel, michalkuligowski, mgawarkiewicz, vivekgoe and afierka-intel as code owners March 14, 2025 20:02

michalkuligowski requested changes Mar 17, 2025

View reviewed changes

vllm/model_executor/layers/rotary_embedding.py Show resolved Hide resolved

michalkuligowski requested changes Mar 21, 2025

View reviewed changes

vllm/model_executor/layers/rotary_embedding.py Outdated Show resolved Hide resolved

michalkuligowski approved these changes Mar 26, 2025

View reviewed changes

michalkuligowski approved these changes Mar 28, 2025

View reviewed changes

michalkuligowski force-pushed the habana_main branch from c27ab2f to 84a8509 Compare March 31, 2025 15:47

mrezavand added 5 commits March 31, 2025 17:54

adding the fix for Phi-3.5 tensor size mismatch

4366634

make the fix upstreamalbe

66fe663

removing the repeated lines

8b1843f

fixing ruff issue

920a11f

pre-commit hooks modification fix

5f70879

michalkuligowski force-pushed the habana_main branch from 84a8509 to 5f70879 Compare March 31, 2025 15:54

michalkuligowski merged commit c795cc5 into HabanaAI:habana_main Apr 1, 2025
37 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixing (1) tensor size mismatch and (2) missing prepare_cos_sin issues for Phi-3.5 #916

Fixing (1) tensor size mismatch and (2) missing prepare_cos_sin issues for Phi-3.5 #916

Uh oh!

mrezavand commented Mar 14, 2025 •

edited by github-actions bot

Loading

Uh oh!

Uh oh!

Uh oh!

michalkuligowski commented Mar 26, 2025

Uh oh!

mrezavand commented Mar 27, 2025

Uh oh!

michalkuligowski commented Mar 28, 2025

Uh oh!

michalkuligowski commented Mar 31, 2025

Uh oh!

michalkuligowski commented Mar 31, 2025

Uh oh!

Uh oh!

Uh oh!

Fixing (1) tensor size mismatch and (2) missing prepare_cos_sin issues for Phi-3.5 #916

Fixing (1) tensor size mismatch and (2) missing prepare_cos_sin issues for Phi-3.5 #916

Uh oh!

Conversation

mrezavand commented Mar 14, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michalkuligowski commented Mar 26, 2025

Uh oh!

mrezavand commented Mar 27, 2025

Uh oh!

michalkuligowski commented Mar 28, 2025

Uh oh!

michalkuligowski commented Mar 31, 2025

Uh oh!

michalkuligowski commented Mar 31, 2025

Uh oh!

Uh oh!

Uh oh!

mrezavand commented Mar 14, 2025 •

edited by github-actions bot

Loading