We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 3b13697 commit 245a2baCopy full SHA for 245a2ba
torchtitan/experiments/llama4/infra/parallelize.py
@@ -118,7 +118,7 @@ def parallelize_llama(
118
)
119
120
# for MoE auxiliary-loss-free load balancing
121
- if parallel_dims.dp_cp_enabled is not None:
+ if parallel_dims.dp_cp_enabled:
122
# NOTE: Currently this sync is blocking (thus exposed) and happens on the
123
# default compute stream. Need to assess if this is OK performance-wise.
124
dp_cp_mesh = world_mesh["dp_cp"]
0 commit comments