运行GPT2案例出现RuntimeError: Could not find 'SLURM_PROCID'问题,是必须要装SLURM环境? #1371
Unanswered
ZXM1063694570
asked this question in
Community | Q&A
Replies: 1 comment
-
Adding '--from_torch' after train_gpt.py would be helpful. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
使用了提供的Dockerhub上的镜像0.1.7,但是在运行GPT案例时候出现RuntimeError: Could not find 'SLURM_PROCID'问题,并且在0.1.8镜像版本中也是如此



这是我的run脚本:
其中我的gpt2_configs配置换了其他的配置也出现同样的问题
Environment:
docker pull hpcaitech/colossalai:0.1.7 & 0.1.8
pip install transformers
pip install titans
8张A100
Beta Was this translation helpful? Give feedback.
All reactions