Bump up the default value of `--n_gpu_layers` #14420

aendk · 2025-06-27T12:10:32Z

aendk
Jun 27, 2025

The current default is 99 (e.g. in llama-bench). Some models already have around half of this.

99 also suggests to some people that this parameter could be a percentage or similar.
It being set to 99 confused me for a bit. Since it is stored as int32, I would suggest setting it to std::numeric_limits<int32>::max.

This would avoid confusion now, and incorrect testing once models grow to around 99 layers in the future.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bump up the default value of `--n_gpu_layers` #14420

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Bump up the default value of --n_gpu_layers #14420

Uh oh!

aendk Jun 27, 2025

Replies: 0 comments

Bump up the default value of `--n_gpu_layers` #14420

aendk
Jun 27, 2025