My output is too low #4164
paulocoutinhox
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
Im trying implement simple.cpp into my app, but the output is less than 20 characters.
Im using the mistral 7b model.
Im checking the simple.cpp file and see this var:
https://github.com/ggerganov/llama.cpp/blob/master/examples/simple/simple.cpp#L30
What it represent?
How can i get the biggest number of tokens removing my own prompt and using this number as tokens to be returned, based on simple.cpp?
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions