We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent cfe6ba5 commit 049aaf1Copy full SHA for 049aaf1
meta-llama/Llama-3.1-8B-Instruct/accuracy/server-rocm.yml
@@ -1,4 +1,4 @@
1
trust-remote-code: true
2
tensor-parallel-size: 1
3
max-model-len: 16384
4
-gpu_memory_utilization: 0.8
+gpu_memory_utilization: 0.6
0 commit comments