Replies: 1 comment 1 reply
-
I met the same problem.. Hope someone can fix this |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all. I try to do something like the following:
However, this gives me
File "/somepath/python3.8/site-packages/deepspeed/runtime/zero/stage3.py", line 1111, in __reduce_and_partition_ipg_grads assert len(set(p.ds_id for p in self.params_in_ipg_bucket)) == len(self.params_in_ipg_bucket) AssertionError
. Does anyone have an idea how to resolve this? I'm using deepspeed zero3 0.10.0 with gradient checkpointing. Thanks!Beta Was this translation helpful? Give feedback.
All reactions