Make test_averaging_cancel more robust #650

mryab · 2025-04-19T11:16:26Z

Currently, test_averaging_cancel is one of the few flaky tests in our codebase: see e.g. https://github.com/learning-at-home/hivemind/actions/runs/14024261668/job/40777059821, where it failed 3 times out of 5.

Based on my investigation and discussion with @justheuristic, it looks like the problem is that it's very loosely defined: we start 4 averaging peers, cancel 2, and expect that the group size is 2 nonetheless. However, due to target_group_size=None by default, it's possible for the groups to have sizes of 3 and 1, which means that the len(control.result()) == 2 will occasionally fail.

To fix this issue, the PR introduces a separate test case with a fixed target_group_size. To make sure that peers don't average before cancellation, it also uses the averaging triggers to arrange the groups without starting the averaging itself

codecov · 2025-04-19T11:30:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 85.45%. Comparing base (d20e810) to head (ba2e073).
Report is 23 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #650      +/-   ##
==========================================
+ Coverage   85.39%   85.45%   +0.05%     
==========================================
  Files          81       96      +15     
  Lines        8006     8575     +569     
==========================================
+ Hits         6837     7328     +491     
- Misses       1169     1247      +78

see 35 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

justheuristic

Thanks! / LGTM

* Make test_averaging_cancel more robust (cherry picked from commit 5353328)

mryab added 2 commits April 19, 2025 13:06

Make test_averaging_cancel more robust

015e054

Remove min_group_size

ba2e073

mryab requested a review from justheuristic April 19, 2025 14:12

justheuristic approved these changes Apr 19, 2025

View reviewed changes

mryab merged commit 5353328 into master Apr 19, 2025
28 of 31 checks passed

mryab deleted the fix-test-averaging-cancel branch April 19, 2025 14:44

mryab added a commit that referenced this pull request Apr 20, 2025

Make test_averaging_cancel more robust (#650)

bdfed32

* Make test_averaging_cancel more robust (cherry picked from commit 5353328)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make test_averaging_cancel more robust #650

Make test_averaging_cancel more robust #650

Uh oh!

mryab commented Apr 19, 2025

Uh oh!

codecov bot commented Apr 19, 2025 •

edited

Loading

Uh oh!

justheuristic left a comment

Uh oh!

Uh oh!

Uh oh!

Make test_averaging_cancel more robust #650

Make test_averaging_cancel more robust #650

Uh oh!

Conversation

mryab commented Apr 19, 2025

Uh oh!

codecov bot commented Apr 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

justheuristic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Apr 19, 2025 •

edited

Loading