We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 61ef5cf commit f7084fcCopy full SHA for f7084fc
torchtitan/experiments/llama4/infra/parallelize.py
@@ -118,7 +118,7 @@ def parallelize_llama(
118
)
119
120
# for MoE auxiliary-loss-free load balancing
121
- if parallel_dims.dp_cp_enabled is not None:
+ if parallel_dims.dp_cp_enabled:
122
# NOTE: Currently this sync is blocking (thus exposed) and happens on the
123
# default compute stream. Need to assess if this is OK performance-wise.
124
dp_cp_mesh = world_mesh["dp_cp"]
0 commit comments