Skip to content

Upstream 091 eplb dynamic #1663

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 195 commits into from
Closed

Conversation

shiyuan680
Copy link

@shiyuan680 shiyuan680 commented Jul 8, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

NPUDev and others added 30 commits June 10, 2025 20:36
add eplb policy and updator
implementation of VllmEplbAdaptor and D2DExpertWeightLoader
determine num_dense_layers and num_moe_layers by refering to model co…
fix bugs in fused_experts_with_all2all
add eplb tabel generator
wanghanqingLYT and others added 22 commits June 25, 2025 14:52
fix bug when running benchmark by move forward_before behind return o…
collect moe load after dispatch
modify serialization of eplb process
improve d2d expert weight update impl in eplb_updator.py
add function take_update_info_from_eplb_process
Copy link

github-actions bot commented Jul 8, 2025

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants