How to set minimum number of output tokens ? #2451
thangld201
announced in
Q&A
Replies: 1 comment
-
You can checkout this PR here #1945 but it has been stalled. If you can help getting this into a good shape that would be even better! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This param is equivalent to
min_new_tokens
in huggingface.generate()
. Is this already supported in vLLM ? @WoosukKwon I only see themax_tokens
option inSamplingParam
..Beta Was this translation helpful? Give feedback.
All reactions