INFO: Found overflow. Skip step #5136
Unanswered
stephencurry-web
asked this question in
Community | Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I trained Llama2-7B chat on the Alpaca dataset, and when I set the batch size to 2 or 4, INFO: Found overflow appeared at each step of the entire training process Skip step, And the gradient is nan, which is normal when I set the batch size to 1. May I ask what the reason is?
Beta Was this translation helpful? Give feedback.
All reactions