-
Notifications
You must be signed in to change notification settings - Fork 55
Open
Description
I see in quite a few examples, there is essentially a cot for doing math steps. It happens after the final answer is provided. This is much harder for the model to learn than having the cot come before the final answer. It also doesn't help with accuracy as much, because the influence of the final answer is already in the context window before it starts doing the steps for the calculation.
Metadata
Metadata
Assignees
Labels
No labels