[P/D] add acc test script of hpu pd disagg #1394

zhenwei-intel · 2025-06-10T06:34:27Z

This PR adds disaggregated (disagg) vs non-disaggregated (baseline) accuracy tests. The changes include the addition of configuration files, a shell script to automate the test process, and a Python script to validate the outputs.

Signed-off-by: zhenwei <zhenweiliu@habana.ai>

Copilot

Pull Request Overview

This PR adds a new pipeline to run and compare non-disaggregated (baseline) and disaggregated accuracy tests against a vLLM-based model.

Introduces a Python script (test_disagg_accuracy.py) to generate outputs, save baseline results, and verify exact matches in disagg mode.
Adds a Bash script (run_hpu_disagg_accuracy_test.sh) to orchestrate etcd, Mooncake, vLLM servers, and execute both test modes end-to-end.
Includes mooncake.json for MooncakeStoreConnector configuration.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
pd_xpyd/test_disagg_accuracy.py	New client script: sends prompts, writes baseline JSON, and asserts disaggregated outputs match
pd_xpyd/run_hpu_disagg_accuracy_test.sh	Automation script: starts services, runs baseline/disagg tests, and cleans up
pd_xpyd/mooncake.json	Configuration for etcd/Mooncake key-value store connector

Comments suppressed due to low confidence (3)

pd_xpyd/test_disagg_accuracy.py:148

In disagg mode you are reading the baseline file, so this should report a read error instead of "Error writing to file".

print(f"Error writing to file: {e}")

pd_xpyd/test_disagg_accuracy.py:69

The docstring says "two optional string arguments" but --service_url and --model_name are required. Update the docstring to reflect the actual behavior and all arguments.

    """
    This script demonstrates how to accept two optional string arguments

pd_xpyd/run_hpu_disagg_accuracy_test.sh:23

[nitpick] This variable uses lowercase naming while others are uppercase. Consider renaming to MAX_NUM_BATCHED_TOKENS for consistency with the script’s style.

max_num_batched_tokens=2048

pd_xpyd/test_disagg_accuracy.py

pd_xpyd/run_hpu_disagg_accuracy_test.sh

pd_xpyd/test_disagg_accuracy.py

jikunshang · 2025-06-11T14:17:36Z

do you think we can reuse this file? https://github.com/HabanaAI/vllm-fork/blob/habana_main/tests/kv_transfer/test_disagg.py

Signed-off-by: zhenwei <zhenweiliu@habana.ai>

zhenwei-intel · 2025-06-17T03:03:46Z

do you think we can reuse this file? https://github.com/HabanaAI/vllm-fork/blob/habana_main/tests/kv_transfer/test_disagg.py

Not very reusable. Here, I run the baseline and PD with one click, then compare whether the outputs are consistent.

Currently, this script is being used as a demo for users or QA.

add acc test script of hpu pd disagg

d5d3119

Signed-off-by: zhenwei <zhenweiliu@habana.ai>

zhenwei-intel requested review from kzawora-intel, madamczyk-intel, michalkuligowski, mgawarkiewicz-intel, vivekgoe, afierka-intel, xuechendi, jikunshang and mswiniarsk as code owners June 10, 2025 06:34

zhenwei-intel requested a review from Copilot June 10, 2025 06:34

Copilot AI reviewed Jun 10, 2025

View reviewed changes

pd_xpyd/test_disagg_accuracy.py Show resolved Hide resolved

pd_xpyd/run_hpu_disagg_accuracy_test.sh Show resolved Hide resolved

pd_xpyd/run_hpu_disagg_accuracy_test.sh Show resolved Hide resolved

michalkuligowski previously requested changes Jun 11, 2025

View reviewed changes

pd_xpyd/test_disagg_accuracy.py Outdated Show resolved Hide resolved

pd_xpyd/test_disagg_accuracy.py Outdated Show resolved Hide resolved

pd_xpyd/test_disagg_accuracy.py Show resolved Hide resolved

zhenwei-intel marked this pull request as draft June 16, 2025 06:52

fix

27b2bbc

Signed-off-by: zhenwei <zhenweiliu@habana.ai>

zhenwei-intel marked this pull request as ready for review June 17, 2025 03:03

xuechendi approved these changes Jun 19, 2025

View reviewed changes

zhenwei-intel enabled auto-merge (squash) June 20, 2025 05:34

Merge branch 'habana_main' into lzw/add_test_script

d062b70

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[P/D] add acc test script of hpu pd disagg #1394

[P/D] add acc test script of hpu pd disagg #1394

Uh oh!

zhenwei-intel commented Jun 10, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jikunshang commented Jun 11, 2025

Uh oh!

zhenwei-intel commented Jun 17, 2025

Uh oh!

Uh oh!

[P/D] add acc test script of hpu pd disagg #1394

Are you sure you want to change the base?

[P/D] add acc test script of hpu pd disagg #1394

Uh oh!

Conversation

zhenwei-intel commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jikunshang commented Jun 11, 2025

Uh oh!

zhenwei-intel commented Jun 17, 2025

Uh oh!

Uh oh!

zhenwei-intel commented Jun 10, 2025 •

edited

Loading