Fix liftover caching bug in tests #878

bpblanken · 2024-08-20T04:42:58Z

There's a sneaky issue we're running into in the tests with hail's liftover being cached between unit tests. We're also not correctly using the locally stored leftovers but we should be!

hanars · 2024-08-20T20:49:58Z

v03_pipeline/lib/tasks/base/base_hail_table.py

+        # This runs "before" as task to account for situations where
+        # the Hail write fails and we do not have the chance to
+        # run this method.
+        remove_liftover()


I understand why we want this for testing, but in production why wouldn't we want to use the cached liftover form a previous run?

In some sense yes. But our current setup runs almost task in a fresh dataproc job so we're getting an empty and clean Hail for the most part anyways.

I could definitely be convinced to move this into the tearDown of the tests since that's more logically correct and more performant (but likely not noticeably), but I think it's easier to reason about this in the application code as part of the initialization process for a task. I don't like there being sneaky state inside of a task context without it being clear as to why.

I don't like there being sneaky state inside of a task context without it being clear as to why.

Maybe update the comment to reflect this to make it clearer why this is being cleaned up at all (the current comment is just explaining why it runs in the before but not why its being run at all)

…s into benb/fix_caching_bug_in_tests

…tute/seqr-loading-pipelines into benb/fix_caching_bug_in_tests

bpblanken added 7 commits August 19, 2024 23:57

env vars

a9cdd72

remove liftover from hail

95d12ef

ruff

d7e9e5e

spelling

a0d5454

move remove_liftover

495db53

set value on mock

0744ab1

ruff

be59b0a

bpblanken changed the title ~~Benb/fix caching bug in tests~~ Fix leftover caching bug in tests Aug 20, 2024

move from before tasks to after tasks

3ba8e8b

bpblanken changed the title ~~Fix leftover caching bug in tests~~ Fix liftover caching bug in tests Aug 20, 2024

move it back

897b998

bpblanken marked this pull request as ready for review August 20, 2024 20:23

bpblanken requested a review from a team as a code owner August 20, 2024 20:23

hanars reviewed Aug 20, 2024

View reviewed changes

bpblanken and others added 5 commits August 20, 2024 17:00

Update base_hail_table.py

a2cd001

Update base_hail_table.py

f5030fc

Merge branch 'dev' of github.com:broadinstitute/seqr-loading-pipeline…

5006834

…s into benb/fix_caching_bug_in_tests

Merge branch 'benb/fix_caching_bug_in_tests' of github.com:broadinsti…

6e11633

…tute/seqr-loading-pipelines into benb/fix_caching_bug_in_tests

ws

ee0a792

bpblanken merged commit 1bfe3b3 into dev Aug 20, 2024
3 checks passed

bpblanken deleted the benb/fix_caching_bug_in_tests branch August 20, 2024 22:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix liftover caching bug in tests #878

Fix liftover caching bug in tests #878

Uh oh!

bpblanken commented Aug 20, 2024 •

edited

Loading

Uh oh!

hanars Aug 20, 2024

Uh oh!

bpblanken Aug 20, 2024

Uh oh!

hanars Aug 20, 2024

Uh oh!

Uh oh!

Uh oh!

Fix liftover caching bug in tests #878

Fix liftover caching bug in tests #878

Uh oh!

Conversation

bpblanken commented Aug 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanars Aug 20, 2024

Choose a reason for hiding this comment

Uh oh!

bpblanken Aug 20, 2024

Choose a reason for hiding this comment

Uh oh!

hanars Aug 20, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bpblanken commented Aug 20, 2024 •

edited

Loading