Skip to content

Commit 049aaf1

Browse files
committed
meta-llama/Llama-3.1-8B-Instruct: decrease gpu_memory_utilization to prevent OOM (ROCm)
1 parent cfe6ba5 commit 049aaf1

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
11
trust-remote-code: true
22
tensor-parallel-size: 1
33
max-model-len: 16384
4-
gpu_memory_utilization: 0.8
4+
gpu_memory_utilization: 0.6

0 commit comments

Comments
 (0)