File tree Expand file tree Collapse file tree 1 file changed +15
-1
lines changed Expand file tree Collapse file tree 1 file changed +15
-1
lines changed Original file line number Diff line number Diff line change 158
158
# - https://github.com/vllm-project/vllm-ascend/pull/395
159
159
# Future Plan:
160
160
# Revert it when the related pr is merged in vllm and vllm-ascend.
161
- #
161
+ #
162
+ # ** File: worker/patch_common/patch_eagle.py **
163
+ # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
164
+ # 1. `vllm.v1.spec_decode.eagle.prepare_inputs`
165
+ # Why:
166
+ # We need to use the patched `prepare_input_kernel` in `eagle.prepare_inputs`.
167
+ # The mainly reason to overwrite `prepare_input_kernel` is this is a triton
168
+ # kernel, ascend is now not support triton kernel.
169
+ # How:
170
+ # Re-implementation the `prepare_input_kernel` triton kernel by pytorch
171
+ # Related PR (if no, explain why): 1. refused by vllm. 2. vllm doesn't support 3. prepare to submit....
172
+ # - https://github.com/vllm-project/vllm-ascend/pull/874
173
+ # Future Plan:
174
+ # Revert it when the ascend support triton kernel.
175
+ #
You can’t perform that action at this time.
0 commit comments