Skip to content

Commit e343e13

Browse files
22dimensionswangxiaoxin (A)
authored andcommitted
[CI] rename Qwen2.5-0.5B-Instruct-W8A8 model (#1145)
1. rename vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8-new to vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8 Signed-off-by: 22dimensions <waitingwind@foxmail.com> Signed-off-by: wangxiaoxin (A) <wangxiaoxin7@huawei.com>
1 parent 192cbbc commit e343e13

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/singlecard/test_offline_inference.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@
3939
MULTIMODALITY_MODELS = ["Qwen/Qwen2.5-VL-3B-Instruct"]
4040

4141
QUANTIZATION_MODELS = [
42-
"vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8-new",
42+
"vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8",
4343
]
4444
os.environ["PYTORCH_NPU_ALLOC_CONF"] = "max_split_size_mb:256"
4545

0 commit comments

Comments
 (0)