Coati SFT Training Process stack in "[extension] Compiling or loading the JIT-built cpu_adam kernel during runtime now" #3468
Unanswered
linmou
asked this question in
Community | Q&A
Replies: 1 comment 1 reply
-
My problem has been resolved. The solution is to change the ColossalAI installation method from the original 'CUDA_EXT=1 pip install colossalai 'Replace with Download From Source,' CUDA '_ EXT=1 pip install .’. I hope it can help everyone |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
When I run Chat/examples/train_sft.sh several times, the process seems stacked somewhere.
No error reports, it just stops after showing "[extension] Compiling or loading the JIT-built cpu_adam kernel during runtime now". Any idea about why and where it stacks?
Beta Was this translation helpful? Give feedback.
All reactions