Skip to content

Commit 14b914e

Browse files
committed
[V1] Support ngram spec decode
Signed-off-by: ponix-j <657511300@qq.com>
1 parent 0f53b13 commit 14b914e

File tree

12 files changed

+1553
-12
lines changed

12 files changed

+1553
-12
lines changed

.github/workflows/vllm_ascend_test.yaml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -161,8 +161,9 @@ jobs:
161161
if: steps.filter_spec_decode.outputs.speculative_tests_changed == 'true' || github.event_name == 'schedule'
162162
run: |
163163
if [[ "${{ matrix.os }}" == "linux-arm64-npu-1" ]]; then
164+
VLLM_USE_MODELSCOPE=true pytest -sv tests/singlecard/spec_decode/e2e/test_v1_spec_decode.py
164165
pytest -sv tests/singlecard/spec_decode/e2e/test_mtp_correctness.py # it needs a clean process
165-
pytest -sv tests/singlecard/spec_decode --ignore=tests/singlecard/spec_decode/e2e/test_mtp_correctness.py
166+
pytest -sv tests/singlecard/spec_decode --ignore=tests/singlecard/spec_decode/e2e/test_mtp_correctness.py --ignore=tests/singlecard/spec_decode/e2e/test_v1_spec_decode.py
166167
fi
167168
168169
- name: Run vllm-project/vllm test for V0 Engine

requirements-dev.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,3 +9,4 @@ ray
99
types-jsonschema
1010
xgrammar
1111
zmq
12+
numba

tests/sample/__init__.py

Whitespace-only changes.

0 commit comments

Comments
 (0)