Skip to content

Pull requests: PaddlePaddle/PaddleNLP

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[LLM] fix moe using on tensor parallelism
#11099 opened Sep 18, 2025 by KB-Ding Loading…
update post norm rc
#11097 opened Sep 17, 2025 by chen2016013 Loading…
2 tasks
Fix FusedRMSLinear backward compute
#11095 opened Sep 17, 2025 by lshpku Loading…
update using_post_norm_recompute
#11093 opened Sep 16, 2025 by chen2016013 Loading…
2 tasks
Adapter flex checkpoint
#11091 opened Sep 15, 2025 by xingmingyyj Loading…
2 tasks
add keti7 scripts
#11086 opened Sep 11, 2025 by FeixLiu Loading…
2 tasks
Support uc save for deepseek
#11078 opened Sep 6, 2025 by DesmonDay Loading…
Pr adapt flex checkpoint contributor
#11065 opened Sep 3, 2025 by zty-king Loading…
实现HuggingFace Cache和缩专家加载
#11042 opened Sep 2, 2025 by lshpku Loading…
fix typos contributor
#11041 opened Sep 2, 2025 by co63oc Loading…
2 tasks
clip expert grad
#11035 opened Sep 1, 2025 by zhangbo9674 Loading…
2 tasks
fix_fa3_mem
#11033 opened Sep 1, 2025 by liuruyan Loading…
2 tasks
Implement auxiliary-loss-free load balancing
#11031 opened Aug 29, 2025 by lshpku Loading…
Add support for DRAG contributor
#11021 opened Aug 27, 2025 by Kinandra Loading…
Try import FlexCheckpoint
#11019 opened Aug 27, 2025 by xingmingyyj Loading…
2 tasks
opt reader and gather
#11016 opened Aug 26, 2025 by phlrain Loading…
2 tasks
Optimie moe and dense overlap
#11013 opened Aug 26, 2025 by phlrain Loading…
2 tasks
[NOT MERGE]Pr adapt flex checkpoint contributor
#10996 opened Aug 25, 2025 by zty-king Loading…
2 tasks
Make pp_stream wait on attn_backward_dx
#10984 opened Aug 21, 2025 by lshpku Loading…
recompute support offload tensor
#10981 opened Aug 21, 2025 by blacksheep-Aristotle Loading…
2 tasks
moe_layer support fine_grained_forward
#10980 opened Aug 21, 2025 by blacksheep-Aristotle Loading…
2 tasks
update expert parallel init logic
#10966 opened Aug 18, 2025 by blacksheep-Aristotle Loading…
2 tasks
optimize mtp speed
#10965 opened Aug 18, 2025 by phlrain Loading…
2 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.