Replies: 2 comments 3 replies
-
Run "main --help" to see parameter usage explained. |
Beta Was this translation helpful? Give feedback.
1 reply
-
You can find the documentation for the main binary here: https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Maybe I'm dumb and missed it, but there seems to be effectively zero documentation on using the windows binaries. I was able to piece together enough to get it running but there's still some things I don't get. For example, it would have been nice to have had a descriptions of which binary to use and what each version means.
And I still haven't found if there's a list of parameters are for launching using main.exe. I found another post where someone used the following:
main.exe -i --interactive-first -r "### Human:" --temp 0 -c 2048 -n -1 --ignore-eos --repeat_penalty 1.2 --instruct -m whateverModelFileName
Which works but I want to be able to use my GPU, yet it says
offloading 0 layers to GPU
,total VRAM used: 0 MB
and I can't find anything talking about how to change that.Beta Was this translation helpful? Give feedback.
All reactions