[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph #836

MengqingCao · 2025-05-13T12:56:17Z

What this PR does / why we need it?

Implentment NPUPiecewiseBackend to enable aclgraph
Eable aclgraph by default in V1, but raise error when running deepseek and raise warning when running models except for qwen

How was this patch tested?

CI pass with the new ut

yiz-liu · 2025-05-23T08:04:49Z

I believe this is acceptable overall; however, we need to standardize our use of either npugraph or aclgraph throughout the codebase. At present, I use aclgraph consistently and recommend that we continue doing so moving forward.

Additionally, some comments still reference “cuda,” which should be updated to “ascend” or “npu.”

@MengqingCao

MengqingCao · 2025-05-23T08:22:00Z

I believe this is acceptable overall; however, we need to standardize our use of either npugraph or aclgraph throughout the codebase. At present, I use aclgraph consistently and recommend that we continue doing so moving forward.

Additionally, some comments still reference “cuda,” which should be updated to “ascend” or “npu.”

@MengqingCao

Good catch, the latest commit has changed to use aclgraph, and the comments have been fixed

MengqingCao · 2025-05-27T06:12:30Z

@yiz-liu @wangxiyuan could you take a look at the latest code?

Currently, aclgraph is enabled in V1 by default, and will raise error if running deepseek and throw warning if running models except for qwen.

wangxiyuan

please rebase after #952 merged.

vllm_ascend/platform.py

Signed-off-by: MengqingCao <cmq0113@163.com>

MengqingCao · 2025-05-28T06:36:19Z

@wangxiyuan thanks for your review and all the comments are addressed now. Could this be merged now?

vllm_ascend/platform.py

Signed-off-by: MengqingCao <cmq0113@163.com>

… main * 'main' of https://github.com/raindaywhu/vllm-ascend: [aclgraph] implentment NPUPiecewiseBackend to enable aclgraph (vllm-project#836) [Bugfix][V1] Fix deepseek with v1 (vllm-project#958) [Perf] Refactor tensor disposal logic to reduce memory usage (vllm-project#966)

…roject#836) ### What this PR does / why we need it? 1. Implentment `NPUPiecewiseBackend` to enable aclgraph 2. Eable aclgraph by default in V1, but raise error when running deepseek and raise warning when running models except for qwen ### How was this patch tested? CI pass with the new ut --------- Signed-off-by: MengqingCao <cmq0113@163.com>

github-actions bot added the module:core label May 13, 2025

MengqingCao mentioned this pull request May 13, 2025

[Compile][Platform] Make PiecewiseBackend pluggable and extendable vllm-project/vllm#18076

Merged

MengqingCao marked this pull request as draft May 13, 2025 13:00

MengqingCao force-pushed the piecewise branch from 77a7166 to 849c5ed Compare May 23, 2025 01:18

MengqingCao marked this pull request as ready for review May 23, 2025 08:22

github-actions bot added module:tests module:ops labels May 23, 2025

MengqingCao force-pushed the piecewise branch 2 times, most recently from 23e1738 to c2c5403 Compare May 27, 2025 01:47

wangxiyuan reviewed May 27, 2025

View reviewed changes

vllm_ascend/platform.py Outdated Show resolved Hide resolved

vllm_ascend/platform.py Show resolved Hide resolved

vllm_ascend/platform.py Show resolved Hide resolved

[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph

163124b

Signed-off-by: MengqingCao <cmq0113@163.com>

MengqingCao force-pushed the piecewise branch from c1474c0 to 163124b Compare May 28, 2025 01:23

skip aclgraph ut in 0.8.5

e273e03

Signed-off-by: MengqingCao <cmq0113@163.com>

wangxiyuan reviewed May 28, 2025

View reviewed changes

vllm_ascend/platform.py Outdated Show resolved Hide resolved

vllm_ascend/platform.py Outdated Show resolved Hide resolved

vllm_ascend/platform.py Outdated Show resolved Hide resolved

fix review

3c7e526

Signed-off-by: MengqingCao <cmq0113@163.com>

wangxiyuan approved these changes May 29, 2025

View reviewed changes

wangxiyuan merged commit a93bed4 into vllm-project:main May 29, 2025
22 checks passed

MengqingCao deleted the piecewise branch June 4, 2025 12:35

MengqingCao mentioned this pull request Jun 12, 2025

[RFC]: E2E CI test for key features #413

Open

83 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph #836

[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph #836

Uh oh!

MengqingCao commented May 13, 2025 •

edited

Loading

Uh oh!

yiz-liu commented May 23, 2025

Uh oh!

MengqingCao commented May 23, 2025

Uh oh!

MengqingCao commented May 27, 2025

Uh oh!

wangxiyuan left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MengqingCao commented May 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph #836

[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph #836

Uh oh!

Conversation

MengqingCao commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

How was this patch tested?

Uh oh!

yiz-liu commented May 23, 2025

Uh oh!

MengqingCao commented May 23, 2025

Uh oh!

MengqingCao commented May 27, 2025

Uh oh!

wangxiyuan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MengqingCao commented May 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MengqingCao commented May 13, 2025 •

edited

Loading