Skip to content

Commit e00e5ea

Browse files
authored
Update __init__.py
1 parent 3797db8 commit e00e5ea

File tree

1 file changed

+15
-1
lines changed

1 file changed

+15
-1
lines changed

vllm_ascend/patch/__init__.py

Lines changed: 15 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -158,4 +158,18 @@
158158
# - https://github.com/vllm-project/vllm-ascend/pull/395
159159
# Future Plan:
160160
# Revert it when the related pr is merged in vllm and vllm-ascend.
161-
#
161+
#
162+
# ** File: worker/patch_common/patch_eagle.py **
163+
# ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
164+
# 1. `vllm.v1.spec_decode.eagle.prepare_inputs`
165+
# Why:
166+
# We need to use the patched `prepare_input_kernel` in `eagle.prepare_inputs`.
167+
# The mainly reason to overwrite `prepare_input_kernel` is this is a triton
168+
# kernel, ascend is now not support triton kernel.
169+
# How:
170+
# Re-implementation the `prepare_input_kernel` triton kernel by pytorch
171+
# Related PR (if no, explain why): 1. refused by vllm. 2. vllm doesn't support 3. prepare to submit....
172+
# - https://github.com/vllm-project/vllm-ascend/pull/874
173+
# Future Plan:
174+
# Revert it when the ascend support triton kernel.
175+
#

0 commit comments

Comments
 (0)