examples : predicted output for text generation #14739

iamlemec · 2025-07-17T15:32:14Z

This adds an example that allows the user to specify predicted (expected) outputs that are then used as a draft to speed up generation. The most prominent use case for this would be for something like making changes to an existing code block. Currently OpenAI has a similar feature with Predicted Outputs.

Obviously this has a lot of overlap with speculative decoding and lookup decoding. I think the main difference is that this gives the user more direct control over the expected output. This will also try to pick the draft up again if there is a difference followed by a few consecutive token matches. So in that sense, it brings in some of the benefits of lookup decoding.

I added in some example scripts for testing code modification. These can compare predicted vs speculative and lookup. I also included a script that uses the Osmosis-Apply-1.7B model that is directly targeted towards code patching.

examples : predicted output for text generation

629e340

github-actions bot added the examples label Jul 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

examples : predicted output for text generation #14739

examples : predicted output for text generation #14739

iamlemec commented Jul 17, 2025

Uh oh!

Uh oh!

examples : predicted output for text generation #14739

Are you sure you want to change the base?

examples : predicted output for text generation #14739

Conversation

iamlemec commented Jul 17, 2025

Uh oh!

Uh oh!