Max model output tokens? #108
Answered
by
rodrigo-f-nogueira
deniseiras
asked this question in
Q&A
-
I came across a 512 token cut in the response of the sabe-3 model. |
Beta Was this translation helpful? Give feedback.
Answered by
rodrigo-f-nogueira
Aug 23, 2024
Replies: 1 comment 4 replies
-
Hi Denis, In the web chat, the maximum number of generated tokens is 4000. In the API, you can configure it by setting the max_tokens parameters: import maritalk
model = maritalk.MariTalk(
key="insira sua chave aqui. Ex: '100088...'",
model="sabia-3"
)
response = model.generate(
"Quanto é 25 + 27?",
max_tokens=200)
answer = response["answer"] |
Beta Was this translation helpful? Give feedback.
4 replies
Answer selected by
deniseiras
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi Denis,
In the web chat, the maximum number of generated tokens is 4000.
In the API, you can configure it by setting the max_tokens parameters: