Check test cases with measurements #2161

Kobzol · 2025-06-17T15:56:14Z

With the new design, it will be possible to backfill results into the DB, for example if you ask on a PR that you want to see results for the cranelift backend (which is not benchmarked by default), the collector will go back and actually backfill cranelift backend data for the parent master commit.

To support that, we need to expand the notion of a benchmark being "done". Right now, we record a (artifact, benchmark_name) tuple into the DB (called a step) when a benchmark begins, and then if we ever encounter the same tuple again, we don't benchmark it again. That's not ideal, because if an error happened and no data was generated, you won't be able to retry the collection without removing everything for the given artifact from the DB. And mainly, you cannot backfill more results (e.g. by running only Debug first, and then backfilling Opt, which is useful also for local experiments).

This PR expands the concept of a benchmark being done by actually checking which compile-time test cases are present in the DB. We cheat a bit to have better perf - if there is at least one recorded statistic in the DB for a given test case, we consider it to be done (so we essentially ignore missing iterations, but that should be a niche edge case).

Even though this logic is mostly useful for the new scheme, which is not implemented yet, I decided to also implement it for the current benchmarking logic, because it's useful for local experiments.

Best reviewed commit by commit.

Kobzol added 5 commits June 17, 2025 07:57

Add function for finding computed test cases for an artifact

ae6786c

Store already computed compile test cases

936acb7

Filter compile-time test cases on a more granular level

d58c068

Do not warn if a step is ended multiple times

4c768ea

Sort scenarios for deterministic output

4aa85e6

Kobzol requested a review from Mark-Simulacrum June 17, 2025 15:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check test cases with measurements #2161

Check test cases with measurements #2161

Uh oh!

Kobzol commented Jun 17, 2025

Uh oh!

Uh oh!

Check test cases with measurements #2161

Are you sure you want to change the base?

Check test cases with measurements #2161

Uh oh!

Conversation

Kobzol commented Jun 17, 2025

Uh oh!

Uh oh!