Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support gradient accumulation CLA Signed This label is managed by the Meta Open Source bot.
#1238 opened May 29, 2025 by janEbert Loading…
Implement initial_load_path for checkpointer CLA Signed This label is managed by the Meta Open Source bot.
#1236 opened May 28, 2025 by fegin Loading…
Add h100 test CLA Signed This label is managed by the Meta Open Source bot.
#1235 opened May 28, 2025 by mori360 Draft
[WIP][deepseek] update deepseek to real training loop, part 1 CLA Signed This label is managed by the Meta Open Source bot.
#1233 opened May 28, 2025 by lessw2020 Loading…
[SimpleFSDP] Add CI for SimpleFSDP CLA Signed This label is managed by the Meta Open Source bot.
#1231 opened May 28, 2025 by ruisizhang123 Loading…
[cp][flex_attention] integration test trial CLA Signed This label is managed by the Meta Open Source bot.
#1228 opened May 27, 2025 by XilunWu Draft
[Flux] Add batched inference CLA Signed This label is managed by the Meta Open Source bot.
#1227 opened May 27, 2025 by CarlosGomes98 Loading…
[WIP] Implement the feature to save unsharded weights at the last step CLA Signed This label is managed by the Meta Open Source bot.
#1219 opened May 23, 2025 by fegin Loading…
[WIP][Experimental] Activation Offloading CLA Signed This label is managed by the Meta Open Source bot.
#1218 opened May 23, 2025 by lessw2020 Loading…
Async TP integration test CLA Signed This label is managed by the Meta Open Source bot.
#1193 opened May 14, 2025 by fegin Draft
[WIP][DeepSeek] DeepSeek training and component integration with Titan main components CLA Signed This label is managed by the Meta Open Source bot.
#1183 opened May 13, 2025 by lessw2020 Loading…
compile: turn off fullgraph=True to support llama4 CLA Signed This label is managed by the Meta Open Source bot.
#1182 opened May 12, 2025 by bdhirsh Loading…
🐛 Use correct path for train_configs
#1163 opened May 2, 2025 by brianlechthaler Loading…
[cp][flex_attention] integration test trial CLA Signed This label is managed by the Meta Open Source bot. module: context parallel
#1160 opened May 1, 2025 by XilunWu Draft
[WIP] float8 rowwise all gather CLA Signed This label is managed by the Meta Open Source bot.
#1157 opened Apr 30, 2025 by danielvegamyhre Draft
[WIP] token-expert assignments and layer affinity tracking for expert placement via ILP solving CLA Signed This label is managed by the Meta Open Source bot.
#1152 opened Apr 28, 2025 by lessw2020 Loading…
Add grad_norm metrics CLA Signed This label is managed by the Meta Open Source bot.
#1143 opened Apr 25, 2025 by yzhangcs Loading…
Enable save plan caching CLA Signed This label is managed by the Meta Open Source bot. fb-exported
#1140 opened Apr 23, 2025 by MeetVadakkanchery Loading…
[WIP] Llama4 Vision Encoder CLA Signed This label is managed by the Meta Open Source bot.
#1116 opened Apr 17, 2025 by pbontrager Loading…
[WIP]Implement llama4 HF format to DCP converter CLA Signed This label is managed by the Meta Open Source bot.
#1104 opened Apr 15, 2025 by fegin Loading…
improve reshard_after_forward logic CLA Signed This label is managed by the Meta Open Source bot.
#1094 opened Apr 11, 2025 by tianyu-l Loading…
[CI] Re-enable async TP test CLA Signed This label is managed by the Meta Open Source bot.
#1090 opened Apr 11, 2025 by kwen2501 Loading…
[Fux] load AutoencoderKL from diffusers CLA Signed This label is managed by the Meta Open Source bot.
#1085 opened Apr 10, 2025 by kashif Loading…
[DeepSeek][kernels] index select permute, cuda CLA Signed This label is managed by the Meta Open Source bot.
#1083 opened Apr 9, 2025 by lessw2020 Loading…
[DeepSeek][Kernels] MoE sorting - Scatter Gather kernels CLA Signed This label is managed by the Meta Open Source bot.
#1065 opened Apr 7, 2025 by lessw2020 Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.