Skip to content

Commit 8b48daa

Browse files
authored
[CI] rename Qwen2.5-0.5B-Instruct-W8A8 model (#1145)
1. rename vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8-new to vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8 Signed-off-by: 22dimensions <waitingwind@foxmail.com>
1 parent 8dd686d commit 8b48daa

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/singlecard/test_offline_inference.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@
3939
MULTIMODALITY_MODELS = ["Qwen/Qwen2.5-VL-3B-Instruct"]
4040

4141
QUANTIZATION_MODELS = [
42-
"vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8-new",
42+
"vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8",
4343
]
4444
os.environ["PYTORCH_NPU_ALLOC_CONF"] = "max_split_size_mb:256"
4545

0 commit comments

Comments
 (0)