如何进行zero优化+TP+PP #2181
Unanswered
yhcc
asked this question in
Community | Q&A
Replies: 1 comment 5 replies
-
目前在开发ZeRO2,后续会更新ZeRO2+PP的例子。不推荐使用ZeRO3+PP,效率比较低 |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
hello, 根据#1839 这里的说法,似乎ZeroInitContext不再推荐使用了,应该修改为https://github.com/hpcaitech/ColossalAI/blob/main/examples/language/gpt/train_gpt_demo.py 这种,然后根据https://colossalaiworkspace.slack.com/archives/C02NAJARJ9Y/p1665482115336439?thread_ts=1665444351.685429&cid=C02NAJARJ9Y 中提到的,ZeroDDP没有测试PP;但在https://github.com/hpcaitech/ColossalAI-Examples/blob/main/language/gpt/train_gpt.py 这里有ZeroInitContext和TP+PP的example。如果不推荐使用ZeroInitContext的话,有无什么案例可以参考更新的做法呀?
Beta Was this translation helpful? Give feedback.
All reactions