-
Notifications
You must be signed in to change notification settings - Fork 612
Closed
lapp0/outlines
#92Labels
enhancementhelp wantedoptimizationRelated to performance optimizationsRelated to performance optimizationsstructured generationLinked to structured generationLinked to structured generation
Description
What behavior of the library made you think about the improvement?
The current structured generation code is creating a -inf
copy of the logits array and setting the allowed token ID indices to the corresponding values in the original logits array. See here.
How would you like it to behave?
When possible, the original logits array should be updated in-place and completely avoid creating a new array. This change would likely require the set of disallowed token IDs instead of the allowed ones.
Metadata
Metadata
Labels
enhancementhelp wantedoptimizationRelated to performance optimizationsRelated to performance optimizationsstructured generationLinked to structured generationLinked to structured generation