Splitting the Heat: Input Temperature & Output Temperature #4237

kalomaze · 2023-11-27T13:49:12Z

kalomaze
Nov 27, 2023

At the moment, LLM software seems to have a schism in the implementation of the temperature option, as I documented here:
#3914

I realized that it might be best to modify it so that you can do two temperature 'passes':

An Input Temperature which is ran before any other sampler and changes the original distribution before any changes are made
An Output Temperature which will come last after all the truncation samplers (such as Top K, Top P, etc) have been ran.

The expected implementation of Temperature (as it is used in OpenAI's models and also inference backends) is to modify the original distribution so that truncation samplers such as Top P or Top K aren't strictly necessary, but the current implementation as it is in llama.cpp functions like my description of Output Temperature.

This can be confusing because truncation fundamentally changes the output in a way that is very similar to lower temperature, except it explicitly cuts out bad choices to do this rather than scaling the model's confidence. This is not a flawed approach, but we want to have interpretability in what the model is doing in response to sampler changes instead of people just setting options they don't understand and getting a very skewed and sometimes unnatural representation of what the model is actually predicting.

This would give users freedom because:

You can apply temperature after the model has selected a set of high quality candidates (post-truncation) to 'randomize' in a way that won't invite the 'low quality' token choices, but instead just works like a way to make the model avoid overly predictable outputs while staying in a safe range.
You can apply temperature before the model has selected its list of candidates in the case that you wanted the make the raw probabilities a little less pre-determined overall before you cut out the unlikely candidates.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Splitting the Heat: Input Temperature & Output Temperature #4237

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Splitting the Heat: Input Temperature & Output Temperature #4237

Uh oh!

Uh oh!

kalomaze Nov 27, 2023

Replies: 0 comments

kalomaze
Nov 27, 2023