Repeated line breaks when using JSON Logits Processor (vLLM) #715
LilianJim
started this conversation in
Weird model output
Replies: 1 comment 1 reply
-
The "workaround" is the solution to this; the default behavior of Outlines is to let the model choose the whitespace pattern but not all models seem to fare fine with that. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the issue as clearly as possible:
When using a
JSONLogitsProcessor
to constrain the output, a bug that often occurs is the model generated line breaks (\n
) indefinitely.I don't think this is the expected behavior, and for now I've only witnessed it with Mixtral 7x8B
Steps/code to reproduce the bug:
reproduce.py
Expected result:
Error message:
Output of
python3 reproduce.py --model casperhansen/mixtral-instruct-awq --quantization awq
Outlines/Python version information:
Version information
Context for the issue:
Found a workaround by passing
whitespace_pattern=r"[\n ]?"
toJSONLogitsProcessor
, which seems to mitigate the issue. Despite this, the described default behavior is probably not wanted.Beta Was this translation helpful? Give feedback.
All reactions