add single request test case for aclgraph #3392

lilinsiman · 2025-10-11T12:50:55Z

What this PR does / why we need it?

This pr adds online single request DP2 test case for aclgraph

Does this PR introduce any user-facing change?

no

How was this patch tested?

ut

vLLM version: v0.11.0rc3
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

github-actions · 2025-10-11T12:51:04Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request adds a new end-to-end test for aclgraph with data parallelism. The test is well-structured, but there's a critical issue in how the test server is initialized that will prevent the test from running. I've also identified a potential source of test flakiness due to a hardcoded port and provided a comprehensive suggestion to fix both issues, making the test more robust.

gemini-code-assist · 2025-10-11T12:53:20Z

tests/e2e/multicard/test_single_request_aclgraph.py

+    env_dict = {
+        "TASK_QUEUE_ENABLE": "1",
+        "HCCL_OP_EXPANSION_MODE": "AIV",
+    }
+    server_args = [
+        "--no-enable-prefix-caching", "--tensor-parallel-size",
+        "1", "--data-parallel-size", str(dp_size), "--port", "20002",
+        "--trust-remote-code", "--gpu-memory-utilization", "0.9"
+    ]
+    request_keyword_args: dict[str, Any] = {
+        **api_keyword_args,
+    }
+    with RemoteOpenAIServer(model,
+                            server_args,
+                            server_port=20002,
+                            env_dict=env_dict,
+                            auto_port=False) as server:


There are a couple of issues with the RemoteOpenAIServer setup:

Incorrect RemoteOpenAIServer call (Critical): The current call to RemoteOpenAIServer is incorrect. The __init__ signature in tests/e2e/conftest.py is (self, model: str, server_host: str, server_port: int, vllm_serve_args: list[str], ...). Your call provides server_args (a list) as the second positional argument for server_host (a string), and misses the required vllm_serve_args argument. This will cause a TypeError at runtime.

Hardcoded port (High): The test uses a hardcoded port 20002. This can cause flaky tests due to port conflicts when running tests in parallel or if the port is already occupied. It's better to use a dynamically allocated port.

Here is a suggested fix that addresses both issues by using vllm.utils.get_open_port to dynamically allocate a port and corrects the RemoteOpenAIServer call. The import is added inside the function for simplicity of the suggestion.

from vllm.utils import get_open_port port = get_open_port() env_dict = { "TASK_QUEUE_ENABLE": "1", "HCCL_OP_EXPANSION_MODE": "AIV", } server_args = [ "--no-enable-prefix-caching", "--tensor-parallel-size", "1", "--data-parallel-size", str(dp_size), "--port", str(port), "--trust-remote-code", "--gpu-memory-utilization", "0.9" ] request_keyword_args: dict[str, Any] = { **api_keyword_args, } with RemoteOpenAIServer(model=model, server_host="localhost", server_port=port, vllm_serve_args=server_args, env_dict=env_dict, auto_port=False) as server:

@lilinsiman Please check these suggested changes.

@lilinsiman Please check these suggested changes.

After checking, it was found that the suggested modification was indeed better. Calling the get_open_port function can automatically return an available port, ensuring that the port is not occupied and thus guaranteeing its availability.

wangxiyuan · 2025-10-13T08:34:31Z

please enable this test in .github workflow

lilinsiman · 2025-10-13T08:36:32Z

tests/e2e/multicard/test_single_request_aclgraph.py

+@pytest.mark.asyncio
+@pytest.mark.parametrize("model", MODELS)
+@pytest.mark.parametrize("dp_size", DATA_PARALLELS)
+async def test_models(model: str, dp_size: int) -> None:


change the case name

lilinsiman · 2025-10-13T08:40:58Z

tests/e2e/multicard/test_single_request_aclgraph.py

+    server_args = [
+        "--no-enable-prefix-caching", "--tensor-parallel-size", "1",
+        "--data-parallel-size",
+        str(dp_size), "--port", "20002", "--trust-remote-code",


delete port code

Signed-off-by: lilinsiman <lilinsiman@gmail.com>

github-actions bot added the module:tests label Oct 11, 2025

gemini-code-assist bot reviewed Oct 11, 2025

View reviewed changes

lilinsiman force-pushed the add_single_request_aclgraph branch 2 times, most recently from d6afb6e to 279e6db Compare October 13, 2025 06:56

lilinsiman commented Oct 13, 2025

View reviewed changes

lilinsiman force-pushed the add_single_request_aclgraph branch from 279e6db to e73fd45 Compare October 13, 2025 09:04

add single request test case for aclgraph

407fb7a

Signed-off-by: lilinsiman <lilinsiman@gmail.com>

lilinsiman force-pushed the add_single_request_aclgraph branch from e73fd45 to 407fb7a Compare October 13, 2025 10:14

MengqingCao added ready read for review ready-for-test start test by label for PR labels Oct 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add single request test case for aclgraph #3392

add single request test case for aclgraph #3392

lilinsiman commented Oct 11, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 11, 2025

Uh oh!

yiz-liu Oct 13, 2025

Uh oh!

lilinsiman Oct 13, 2025

Uh oh!

wangxiyuan commented Oct 13, 2025

Uh oh!

lilinsiman Oct 13, 2025

Uh oh!

lilinsiman Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

add single request test case for aclgraph #3392

Are you sure you want to change the base?

add single request test case for aclgraph #3392

Conversation

lilinsiman commented Oct 11, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

yiz-liu Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

lilinsiman Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

wangxiyuan commented Oct 13, 2025

Uh oh!

lilinsiman Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

lilinsiman Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lilinsiman commented Oct 11, 2025 •

edited by github-actions bot

Loading