Max model output tokens? #108

deniseiras · 2024-08-23T19:42:40Z

deniseiras
Aug 23, 2024

I came across a 512 token cut in the response of the sabe-3 model.
Where can I check this information for all models?
How can I change the call to return more tokens?
Cheers!

Answered by rodrigo-f-nogueira

Aug 23, 2024

Hi Denis,

In the web chat, the maximum number of generated tokens is 4000.

In the API, you can configure it by setting the max_tokens parameters:

import maritalk

model = maritalk.MariTalk(
    key="insira sua chave aqui. Ex: '100088...'",
    model="sabia-3"
)

response = model.generate(
    "Quanto é 25 + 27?",
    max_tokens=200)

answer = response["answer"]

View full answer

rodrigo-f-nogueira · 2024-08-23T20:12:53Z

rodrigo-f-nogueira
Aug 23, 2024
Maintainer

Hi Denis,

In the web chat, the maximum number of generated tokens is 4000.

In the API, you can configure it by setting the max_tokens parameters:

import maritalk

model = maritalk.MariTalk(
    key="insira sua chave aqui. Ex: '100088...'",
    model="sabia-3"
)

response = model.generate(
    "Quanto é 25 + 27?",
    max_tokens=200)

answer = response["answer"]

4 replies

deniseiras Aug 26, 2024
Author

Hi,
thanks for your answer, it works.
What is the maximum value for max_tokens ? (At https://www.maritaca.ai/ I can see "200 mil tokens de saída por minuto". So it's possible to get completion with 200k tokens in a single call ? or is there limits related to message in HTTP call ?
Cheers

rodrigo-f-nogueira Aug 26, 2024
Maintainer

Hi @deniseiras,

200k tokens/min is the rate limit, i.e., if you send requests such that more than 200k tokens are generated in one minute, the server will refuse to generate more tokens. Then you have to wait one minute or so to send new requests.

The max_tokens parameter is the amount of tokens generated in a single request. For instance, if you are using Sabia-3, which supports 32000 context tokens, and your prompt (input) has 1000 tokens, max_tokens can set to a maximum of 32000 - 1000 = 31000 tokens.

deniseiras Aug 26, 2024
Author

Thanks Rodrigo, I understand all now!

rodrigo-f-nogueira Aug 26, 2024
Maintainer

Great, please let us if you any further questions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Max model output tokens? #108

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 4 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Max model output tokens? #108

Uh oh!

deniseiras Aug 23, 2024

Replies: 1 comment · 4 replies

Uh oh!

rodrigo-f-nogueira Aug 23, 2024 Maintainer

Uh oh!

Uh oh!

deniseiras Aug 26, 2024 Author

Uh oh!

rodrigo-f-nogueira Aug 26, 2024 Maintainer

Uh oh!

deniseiras Aug 26, 2024 Author

Uh oh!

rodrigo-f-nogueira Aug 26, 2024 Maintainer

deniseiras
Aug 23, 2024

Replies: 1 comment 4 replies

rodrigo-f-nogueira
Aug 23, 2024
Maintainer

deniseiras Aug 26, 2024
Author

rodrigo-f-nogueira Aug 26, 2024
Maintainer

deniseiras Aug 26, 2024
Author

rodrigo-f-nogueira Aug 26, 2024
Maintainer