[CB] Add scheduling tests #329

sducouedic · 2025-07-22T18:55:55Z

This PR adds a scheduling steps test where new prompts are joining during the decode of other sequences, when there is still room left in the batch for new sequences.

Execution was tested on AIU as well (passing)

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

github-actions · 2025-07-22T18:56:03Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

sducouedic · 2025-07-23T11:12:14Z

bot:test
MARKERS="spyre and not quantized and not multi and not embedding"

tests/e2e/test_spyre_cb_scheduler_steps.py

prashantgupta24

Excited for this!

sducouedic · 2025-07-24T09:49:48Z

Let me break this PR in to two PRs

This PR: add a simple additional test for steps
Another PR: check the end output of scheduling steps tests

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

maxdebayser

Thanks, this is very easy to follow with pencil and paper. It's almost like a documentation.

maxdebayser · 2025-07-24T17:04:03Z

tests/e2e/test_spyre_cb_scheduler_steps.py

+    Configuration:
+        * max_num_seqs: 4
+        * number of prompts: 4
+            * 1: len = 49, max tokens = 119, step joining = 0


Suggestion: maybe start counting at 0 here to use the sequence IDs

prashantgupta24 · 2025-07-24T20:01:53Z

tests/e2e/test_spyre_cb_scheduler_steps.py

+    seqs_max_tokens = [119, 52, 104, 64]
+    prompts_lengths = [49, 14, 89, 9]
+    steps_add_reqs = [0, 0, 32, 131]


I wonder if we can lower the values here - the time for CB testing on CPU is on the rise, maybe (if possible) having shorter max_tokens can speed tests up if the test logic remains the same

I think the eventual goal could be to reduce the total number of steps - the lesser the steps, the faster the test. I don't think we really need 197 steps for this test case?

Something like

seqs_max_tokens = [3, 10, 5] prompts_lengths = [10, 10, 10] steps_add_reqs = [0, 0, 5]

where request 0 would finish first, request 1 would be still decoding when request 2 shows up? Or am I missing something obvious?

If this can be made to work with lesser max_tokens, then perhaps we can open an issue to change all tests within this file to use lesser values to speed things up?

sducouedic added 4 commits July 21, 2025 13:17

save as I have issue in my container

a5d7fa1

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

save

84db72b

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

finished test

608ba62

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

add 'max_model_len' as parameter

6e1f33a

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

sducouedic force-pushed the add_scheduling_tests branch 2 times, most recently from 0915466 to 59e7270 Compare July 23, 2025 11:03

prashantgupta24 reviewed Jul 23, 2025

View reviewed changes

tests/e2e/test_spyre_cb_scheduler_steps.py Show resolved Hide resolved

prashantgupta24 reviewed Jul 23, 2025

View reviewed changes

sducouedic force-pushed the add_scheduling_tests branch from ac373c0 to 6e1f33a Compare July 24, 2025 09:59

modify last request max_tokens 65 to 64

2264005

Signed-off-by: Sophie du Couédic <sop@zurich.ibm.com>

sducouedic marked this pull request as ready for review July 24, 2025 10:05

sducouedic requested a review from rafvasq as a code owner July 24, 2025 10:05

sducouedic enabled auto-merge (squash) July 24, 2025 10:12

github-actions bot added the ready label Jul 24, 2025

maxdebayser approved these changes Jul 24, 2025

View reviewed changes

sducouedic merged commit 2d65d56 into main Jul 24, 2025
15 of 18 checks passed

sducouedic deleted the add_scheduling_tests branch July 24, 2025 17:05

prashantgupta24 reviewed Jul 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CB] Add scheduling tests #329

[CB] Add scheduling tests #329

sducouedic commented Jul 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

sducouedic commented Jul 23, 2025

Uh oh!

Uh oh!

prashantgupta24 left a comment

Uh oh!

sducouedic commented Jul 24, 2025

Uh oh!

maxdebayser left a comment

Uh oh!

maxdebayser Jul 24, 2025

Uh oh!

Uh oh!

prashantgupta24 Jul 24, 2025

Uh oh!

prashantgupta24 Jul 24, 2025

Uh oh!

prashantgupta24 Jul 24, 2025 •

edited

Loading

Uh oh!

prashantgupta24 Jul 24, 2025

Uh oh!

Uh oh!

[CB] Add scheduling tests #329

[CB] Add scheduling tests #329

Conversation

sducouedic commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

sducouedic commented Jul 23, 2025

Uh oh!

Uh oh!

prashantgupta24 left a comment

Choose a reason for hiding this comment

Uh oh!

sducouedic commented Jul 24, 2025

Uh oh!

maxdebayser left a comment

Choose a reason for hiding this comment

Uh oh!

maxdebayser Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

prashantgupta24 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sducouedic commented Jul 22, 2025 •

edited

Loading

prashantgupta24 Jul 24, 2025 •

edited

Loading