Skip to content

why does ./main use vram even if i did -ngl 0 -ngld 0? #3587

Closed Answered by KerfuffleV2
hiqsociety asked this question in Q&A
Discussion options

You must be logged in to vote

Prompt processing can use VRAM even when not offloading layers. As far as I know, there isn't a way to disable that, so basically the solution would be to compile an executable that only has CPU support and use that in the case you want to be use it won't touch the GPU.

Just note that ingesting the prompt/running stuff like perplexity is much, much slower on CPU.

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@hiqsociety
Comment options

@KerfuffleV2
Comment options

@hiqsociety
Comment options

@hiqsociety
Comment options

Answer selected by hiqsociety
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants