Skip to content

Commit 866ce71

Browse files
authored
[Benchmark] Download model from modelscope (#634)
### What this PR does / why we need it? - Run benchmark scripts will Download model from modelscope Signed-off-by: wangli <wangli858794774@gmail.com>
1 parent 05bdcbe commit 866ce71

File tree

4 files changed

+8
-4
lines changed

4 files changed

+8
-4
lines changed

benchmarks/scripts/run-performance-benchmarks.sh

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -264,6 +264,10 @@ main() {
264264
# turn of the reporting of the status of each request, to clean up the terminal output
265265
export VLLM_LOG_LEVEL="WARNING"
266266

267+
# set env
268+
export VLLM_USE_MODELSCOPE="True"
269+
export HF_ENDPOINT="https://hf-mirror.com"
270+
267271
# prepare for benchmarking
268272
cd benchmarks || exit 1
269273
get_benchmarks_scripts

benchmarks/tests/latency-tests.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
{
33
"test_name": "latency_llama8B_tp1",
44
"parameters": {
5-
"model": "meta-llama/Llama-3.1-8B-Instruct",
5+
"model": "LLM-Research/Meta-Llama-3.1-8B-Instruct",
66
"tensor_parallel_size": 1,
77
"load_format": "dummy",
88
"num_iters_warmup": 5,

benchmarks/tests/serving-tests.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -8,15 +8,15 @@
88
"inf"
99
],
1010
"server_parameters": {
11-
"model": "meta-llama/Llama-3.1-8B-Instruct",
11+
"model": "LLM-Research/Meta-Llama-3.1-8B-Instruct",
1212
"tensor_parallel_size": 1,
1313
"swap_space": 16,
1414
"disable_log_stats": "",
1515
"disable_log_requests": "",
1616
"load_format": "dummy"
1717
},
1818
"client_parameters": {
19-
"model": "meta-llama/Llama-3.1-8B-Instruct",
19+
"model": "LLM-Research/Meta-Llama-3.1-8B-Instruct",
2020
"backend": "vllm",
2121
"dataset_name": "sharegpt",
2222
"dataset_path": "./ShareGPT_V3_unfiltered_cleaned_split.json",

benchmarks/tests/throughput-tests.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
{
33
"test_name": "throughput_llama8B_tp1",
44
"parameters": {
5-
"model": "meta-llama/Llama-3.1-8B-Instruct",
5+
"model": "LLM-Research/Meta-Llama-3.1-8B-Instruct",
66
"tensor_parallel_size": 1,
77
"load_format": "dummy",
88
"dataset_path": "./ShareGPT_V3_unfiltered_cleaned_split.json",

0 commit comments

Comments
 (0)