Skip to content

Commit cfe6ba5

Browse files
committed
Llama-3.1-8B-Instruct add accuracy/server-rocm.
1 parent 73e07f2 commit cfe6ba5

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed
Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
trust-remote-code: true
2+
tensor-parallel-size: 1
3+
max-model-len: 16384
4+
gpu_memory_utilization: 0.8

0 commit comments

Comments
 (0)