Replies: 1 comment
-
Any chance you've figured this out since posting? Running into the same error in a similar setting. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Suppose I have a function thats calls
shard_map
, is it possible to parallaize it usingshard_map
(or by other means) over some unused axes?For example, consider
shard_loss_by_model
below, which shards the computation ofloss
over the 'model' axis. Is it possible to parallelize it over the 'batch' axis?Note that the naive implementation (see
nested
below) throws anAttributeError: 'RewriteTrace' object has no attribute 'process_shard_map'
exception.Beta Was this translation helpful? Give feedback.
All reactions