Auto calculate max number of layers with -ngl -1? #6594

Closed

Closed

Auto calculate max number of layers with -ngl -1?#6594

Labels

enhancementstale

opened

on Apr 10, 2024

If the entire model and context size don't fit, can we have a feature -ngl -1 where that automatically calculates the maximum number of layers that could fit into the gpu, and offload the rest to cpu?

Metadata

Assignees

No one assigned

Labels

enhancementstale

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests