Add flaky pytest infrastructure and weekend runners #1799

paul0403 · 2025-06-10T20:27:40Z

Context:
We wish to reduce flaky tests in CI.
There is a python package flaky (which we already have in our requirements.txt) that reruns failed tests to hunt for flakiness.
Let's actually use it.

Description of the Change:

Add flaky infrastructure. A pytest run that hunts for flaky tests can be launched by make pytest ENABLE_FLAKY=ON
4 tests that change some global state cannot be run more than once. We force them to be run only once. None of them are of a stochastic nature
Add a script to launch flaky hunting runs on weekends.

Benefits:
If any new tests are flaky, this weekend runner will catch them!

[sc-87028]

github-actions · 2025-06-10T20:27:55Z

Hello. You may have forgotten to update the changelog!
Please edit doc/releases/changelog-dev.md on your branch with:

A one-to-two sentence description of the change. You may include a small working example for new features.
A link back to this PR.
Your name (or GitHub username) in the contributors section.

.github/workflows/check-weekly-flaky.yaml

dime10

Great idea 💯

The only thing I would say is that we should fix those global state mutations instead of forcing the tests to only run once :)

.github/workflows/check-weekly-flaky.yaml

(suggestion by github action bot)

paul0403 · 2025-06-11T17:38:45Z

Since this action installs from testpypi, this PR's CI will fail tests for tests depending on new features still not on testpypi yet. For example, Tuesday midnight's test pypi will fail tests added on Wednesday.

This is fine. I only enable the script on PRs to check the script itself.

…nce!

frontend/test/pytest/device/test_decomposition.py

codecov · 2025-06-11T19:47:53Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.61%. Comparing base (2c0917c) to head (86755fb).
Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1799   +/-   ##
=======================================
  Coverage   96.61%   96.61%           
=======================================
  Files          82       82           
  Lines        9265     9265           
  Branches      875      875           
=======================================
  Hits         8951     8951           
  Misses        255      255           
  Partials       59       59

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

paul0403 · 2025-06-11T20:00:48Z

frontend/test/pytest/test_autograph.py

+def reset_Failing():
+    save = Failing.triggered.copy()
+    yield
+    Failing.triggered = save


This one is more straightforward, this Failing class uses a class variable to keep track of labels it already saw, and this track record of labels is not reset over different tests, or different runs of the same test.

I just add a fixture to reset it everytime.

paul0403 added 3 commits June 10, 2025 15:52

Add flaky test infra in make pytest

bb4634e

5 runs

931db6a

add weekly flaky test runner script

d61ea62

paul0403 requested a review from a team June 10, 2025 20:27

github-advanced-security bot found potential problems Jun 10, 2025

View reviewed changes

.github/workflows/check-weekly-flaky.yaml Fixed Show resolved Hide resolved

paul0403 added 2 commits June 10, 2025 17:10

try skipping cuda

cc61117

don't ask when pip uninstall

298a9ab

dime10 requested changes Jun 10, 2025

View reviewed changes

testpypi catalyst follows dep versions already

41ca911

paul0403 commented Jun 10, 2025

View reviewed changes

.github/workflows/check-weekly-flaky.yaml Outdated Show resolved Hide resolved

paul0403 added 3 commits June 11, 2025 09:41

Merge remote-tracking branch 'origin/main' into paul0403/add_flaky_runs

76eab9d

Merge remote-tracking branch 'origin/main' into paul0403/add_flaky_runs

ec82572

set read-only permission on job

fdcd8f3

(suggestion by github action bot)

paul0403 added 3 commits June 11, 2025 15:24

popping capabilities should be per custom device class, NOT per insta…

b61f2db

…nce!

remove unused flaky import

a981a4a

Merge remote-tracking branch 'origin/main' into paul0403/add_flaky_runs

91f9c47

paul0403 commented Jun 11, 2025

View reviewed changes

frontend/test/pytest/device/test_decomposition.py Show resolved Hide resolved

paul0403 added 2 commits June 11, 2025 15:52

add reset fixture to autograph Failing test util class

38b8753

add failure notif

35feb0f

paul0403 commented Jun 11, 2025

View reviewed changes

paul0403 added 3 commits June 11, 2025 16:15

codefactor

4d78c8e

test failure notif

a804c32

remove temporary PR testings

29e82b8

paul0403 requested a review from dime10 June 11, 2025 20:51

paul0403 added 3 commits June 12, 2025 09:40

Merge remote-tracking branch 'origin/main' into paul0403/add_flaky_runs

d2f87e1

remove unnecessary line

86755fb

add docstring for the mysterious Failing class in autograph test

05421ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add flaky pytest infrastructure and weekend runners #1799

Add flaky pytest infrastructure and weekend runners #1799

paul0403 commented Jun 10, 2025

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

Uh oh!

dime10 left a comment

Uh oh!

Uh oh!

paul0403 commented Jun 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

codecov bot commented Jun 11, 2025 •

edited

Loading

Uh oh!

paul0403 Jun 11, 2025

Uh oh!

Uh oh!

Add flaky pytest infrastructure and weekend runners #1799

Are you sure you want to change the base?

Add flaky pytest infrastructure and weekend runners #1799

Conversation

paul0403 commented Jun 10, 2025

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

Uh oh!

dime10 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

paul0403 commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

paul0403 Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

paul0403 commented Jun 11, 2025 •

edited

Loading

codecov bot commented Jun 11, 2025 •

edited

Loading