Skip to content

[0.9.1][PD][Perf] Avoid performing cpu all_reduce in disaggregated-prefill scenario. #1644

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: v0.9.1-dev
Choose a base branch
from

Conversation

whx-sjtu
Copy link
Contributor

@whx-sjtu whx-sjtu commented Jul 7, 2025

Avoid performing cpu all_reduce in disaggregated-prefill scenario for better performance.

@whx-sjtu whx-sjtu changed the base branch from main to v0.9.1-dev July 7, 2025 05:08
@whx-sjtu whx-sjtu force-pushed the remove_pd_allreduce branch from f55250a to b89e1da Compare July 7, 2025 05:17
Signed-off-by: whx-sjtu <2952154980@qq.com>
@whx-sjtu whx-sjtu force-pushed the remove_pd_allreduce branch from b89e1da to 796d9a1 Compare July 7, 2025 08:19
@wangxiyuan wangxiyuan changed the title [PD][Perf] Avoid performing cpu all_reduce in disaggregated-prefill scenario. [0.9.1][PD][Perf] Avoid performing cpu all_reduce in disaggregated-prefill scenario. Jul 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant