-
Notifications
You must be signed in to change notification settings - Fork 379
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support gradient accumulation
CLA Signed
This label is managed by the Meta Open Source bot.
#1238
opened May 29, 2025 by
janEbert
Loading…
Implement initial_load_path for checkpointer
CLA Signed
This label is managed by the Meta Open Source bot.
#1236
opened May 28, 2025 by
fegin
Loading…
[WIP][deepseek] update deepseek to real training loop, part 1
CLA Signed
This label is managed by the Meta Open Source bot.
#1233
opened May 28, 2025 by
lessw2020
Loading…
[SimpleFSDP] Add CI for SimpleFSDP
CLA Signed
This label is managed by the Meta Open Source bot.
#1231
opened May 28, 2025 by
ruisizhang123
Loading…
[cp][flex_attention] integration test trial
CLA Signed
This label is managed by the Meta Open Source bot.
[Flux] Add batched inference
CLA Signed
This label is managed by the Meta Open Source bot.
#1227
opened May 27, 2025 by
CarlosGomes98
Loading…
[WIP] Implement the feature to save unsharded weights at the last step
CLA Signed
This label is managed by the Meta Open Source bot.
#1219
opened May 23, 2025 by
fegin
Loading…
[WIP][Experimental] Activation Offloading
CLA Signed
This label is managed by the Meta Open Source bot.
#1218
opened May 23, 2025 by
lessw2020
Loading…
[WIP][DeepSeek] DeepSeek training and component integration with Titan main components
CLA Signed
This label is managed by the Meta Open Source bot.
#1183
opened May 13, 2025 by
lessw2020
Loading…
compile: turn off fullgraph=True to support llama4
CLA Signed
This label is managed by the Meta Open Source bot.
#1182
opened May 12, 2025 by
bdhirsh
Loading…
[cp][flex_attention] integration test trial
CLA Signed
This label is managed by the Meta Open Source bot.
module: context parallel
[WIP] float8 rowwise all gather
CLA Signed
This label is managed by the Meta Open Source bot.
#1157
opened Apr 30, 2025 by
danielvegamyhre
•
Draft
[WIP] token-expert assignments and layer affinity tracking for expert placement via ILP solving
CLA Signed
This label is managed by the Meta Open Source bot.
#1152
opened Apr 28, 2025 by
lessw2020
Loading…
Add This label is managed by the Meta Open Source bot.
grad_norm
metrics
CLA Signed
#1143
opened Apr 25, 2025 by
yzhangcs
Loading…
Enable save plan caching
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#1140
opened Apr 23, 2025 by
MeetVadakkanchery
Loading…
[WIP] Llama4 Vision Encoder
CLA Signed
This label is managed by the Meta Open Source bot.
#1116
opened Apr 17, 2025 by
pbontrager
Loading…
[WIP]Implement llama4 HF format to DCP converter
CLA Signed
This label is managed by the Meta Open Source bot.
#1104
opened Apr 15, 2025 by
fegin
Loading…
improve reshard_after_forward logic
CLA Signed
This label is managed by the Meta Open Source bot.
#1094
opened Apr 11, 2025 by
tianyu-l
Loading…
[CI] Re-enable async TP test
CLA Signed
This label is managed by the Meta Open Source bot.
#1090
opened Apr 11, 2025 by
kwen2501
Loading…
[Fux] load AutoencoderKL from diffusers
CLA Signed
This label is managed by the Meta Open Source bot.
#1085
opened Apr 10, 2025 by
kashif
Loading…
[DeepSeek][kernels] index select permute, cuda
CLA Signed
This label is managed by the Meta Open Source bot.
#1083
opened Apr 9, 2025 by
lessw2020
Loading…
[DeepSeek][Kernels] MoE sorting - Scatter Gather kernels
CLA Signed
This label is managed by the Meta Open Source bot.
#1065
opened Apr 7, 2025 by
lessw2020
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.