[CI]Add e2e test for 310p #1879

zhangxinyuehfad · 2025-07-18T08:03:25Z

What this PR does / why we need it?

Add e2e test for 310p:
trigger conditions：tag, labels(ready-for-test, e2e-310p-test), schedule
image: m.daocloud.io/quay.io/ascend/cann:8.1.rc1-310p-ubuntu22.04-py3.10
runner: linux-aarch64-310p-1, linux-aarch64-310p-4
model: IntervitensInc/pangu-pro-moe-model, Qwen/Qwen3-0.6B-Base, Qwen/Qwen2.5-7B-Instruct

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.9.2
vLLM main: vllm-project/vllm@fe56180

codecov · 2025-07-18T10:26:12Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 71.49%. Comparing base (ff97740) to head (c9039d5).
Report is 4 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1879   +/-   ##
=======================================
  Coverage   71.49%   71.49%           
=======================================
  Files          86       86           
  Lines        9131     9131           
=======================================
  Hits         6528     6528           
  Misses       2603     2603

Flag	Coverage Δ
unittests	`71.49% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

leo-pony · 2025-07-21T01:26:02Z

LGTM if CI passed

wangxiyuan · 2025-07-21T01:34:00Z

tests/e2e/310p/test_offline_inference_multicard.py

@@ -0,0 +1,51 @@
+#


do not create a new forder. change to something like:
tests/e2e/singlecard/test_offiline_inference_310p.py

wangxiyuan · 2025-07-21T01:34:22Z

tests/e2e/310p/test_offline_inference_multicard.py

+# See the License for the specific language governing permissions and
+# limitations under the License.
+# This file is a part of the vllm-ascend project.
+# Adapted from vllm/tests/basic_correctness/test_basic_correctness.py


useless 2 lines

wangxiyuan · 2025-07-21T01:34:31Z

tests/e2e/310p/test_offline_inference_multicard.py

+#
+"""Compare the short outputs of the Pangu (Ascend) model when using greedy sampling.
+
+Run `pytest tests/e2e/test_offline_inference.py`.


wangxiyuan · 2025-07-22T03:21:30Z

.github/workflows/vllm_ascend_test_310p.yaml

@@ -0,0 +1,117 @@
+#


why create a new workflow? you can just add 2 jobs e2e-310p and e2e-4-cards-310p in vllm_ascend_test like other does.

The startup method of this workflow is different with vllm_ascend_test , including label, schedule and tag.

Yikun · 2025-07-24T02:48:14Z

.github/workflows/vllm_ascend_test_310p.yaml

+      # TODO(yikun): Remove m.daocloud.io prefix when infra proxy ready
+      image: m.daocloud.io/quay.io/ascend/cann:8.1.rc1-310p-ubuntu22.04-py3.10


Change this according to: #1912

Yikun · 2025-07-24T02:55:46Z

tests/e2e/multicard/test_offline_inference_310p.py

+@pytest.mark.parametrize("model", MODELS)
+@pytest.mark.parametrize("dtype", ["float16"])
+@pytest.mark.parametrize("max_tokens", [5])
+def test_pangu_model(model: str, dtype: str, max_tokens: int) -> None:


It's better to also add torchair testcase

@Angazenn Please give some parameter examples here

sure, here is an example for pangu with torchair:

additional_config = { "ascend_scheduler_config": { "enabled": True, }, "torchair_graph_config": { "enabled": True, }, } with VllmRunner( "vllm-ascend/pangu-pro-moe-pruing", dtype="half", tensor_parallel_size=4, distributed_executor_backend="mp", enforce_eager=False, additional_config=additional_config, enable_expert_parallel=True, compilation_config={ "custom_ops": ["+unquantized_fused_moe"] } ) as vllm_model:

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

github-actions bot added the module:tests label Jul 18, 2025

zhangxinyuehfad force-pushed the zxy_310p_ci branch from 14c173e to 8fd9102 Compare July 18, 2025 09:12

vllm-ascend-ci added ready-for-test start test by label for PR e2e-310p-test labels Jul 18, 2025

zhangxinyuehfad force-pushed the zxy_310p_ci branch from 8fd9102 to c567f9c Compare July 18, 2025 10:11