We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 73e07f2 commit cfe6ba5Copy full SHA for cfe6ba5
meta-llama/Llama-3.1-8B-Instruct/accuracy/server-rocm.yml
@@ -0,0 +1,4 @@
1
+trust-remote-code: true
2
+tensor-parallel-size: 1
3
+max-model-len: 16384
4
+gpu_memory_utilization: 0.8
0 commit comments