add sample qc task and filter_flags #1034

jklugherz · 2025-02-06T21:48:25Z

adds sample_qc luigi task with a single metric - filter_flags - that outputs a json file, makes it a dependency of the write callset task, and adds the sample qc json to the metadata json.

v03_pipeline/lib/paths.py

v03_pipeline/lib/tasks/write_remapped_and_subsetted_callset.py

…o sample-qc-filter_flags

…lrate

v03_pipeline/lib/methods/sample_qc.py

…lrate

jklugherz · 2025-02-14T22:11:47Z

v03_pipeline/lib/tasks/write_sample_qc_json_test.py

+
+
+class WriteSampleQCJsonTaskTest(MockedDatarootTestCase):
+    @patch('v03_pipeline.lib.tasks.write_sample_qc_json.WriteTDRMetricsFilesTask')


I was not able to figure out the bigquery function mock contamination that was happening when this test was run with WriteSexCheckTableTaskTest, so I just mocked the entire WriteTDRMetricsFilesTask 🤷

v03_pipeline/lib/methods/sample_qc.py

bpblanken · 2025-02-19T16:18:25Z

v03_pipeline/lib/tasks/write_sample_qc_json.py

+        sample_qc_dict = defaultdict(dict)
+        for row in ht.flatten().collect():
+            r = dict(row)
+            sample_id = r.pop('s')


might be cleaner as

for field, value in r.items(): sample_qc_dict[r.pop('s')][field] = value

idt this is possible RuntimeError: dictionary changed size during iteration

v03_pipeline/lib/methods/sample_qc.py

bpblanken · 2025-02-25T04:35:14Z

v03_pipeline/lib/misc/io.py

@@ -244,6 +244,17 @@ def import_imputed_sex(imputed_sex_path: str) -> hl.Table:
    return ht.key_by(ht.s)


+def import_tdr_qc_metrics(file_path: str) -> hl.Table:
+    ht = hl.import_table(file_path)


I think there’s a way to define the types for non-strings at import time, we should try that!

bpblanken · 2025-02-25T04:36:51Z

v03_pipeline/lib/paths.py

+            dataset_type,
+        ),
+        'sample_qc',
+        f'{hashlib.sha256(callset_path.encode("utf8")).hexdigest()}.json',


There’s the new “callset_path_hash” function that snuck in after this was started. We can use it here!

bpblanken

lgtm! thank you!

jklugherz added 2 commits February 6, 2025 16:45

add filtered_callrate sample qc metric and luigi task

b5013d7

delete extraneous test file

61a7d20

jklugherz requested a review from a team as a code owner February 6, 2025 21:48

bpblanken reviewed Feb 6, 2025

View reviewed changes

v03_pipeline/lib/paths.py Outdated Show resolved Hide resolved

bpblanken reviewed Feb 6, 2025

View reviewed changes

v03_pipeline/lib/tasks/write_remapped_and_subsetted_callset.py Outdated Show resolved Hide resolved

jklugherz added 8 commits February 7, 2025 15:49

add to metadata

9228694

feature flag in write metadata

1c1afc8

expect tdr metrics in unit tests

0a474dd

mock env

0ad49d4

mock_ff

52997df

add sample qc filter flags

3ce37fa

chimera

7c3d76f

fix the TEST

68646b5

jklugherz requested a review from bpblanken February 11, 2025 16:18

jklugherz added 5 commits February 11, 2025 11:19

Merge remote-tracking branch 'origin/sample-qc-filtered-callrate' int…

e51af50

…o sample-qc-filter_flags

update tsv test file

9b6baf7

types!

042a67e

types

62294fc

remove filtered_callrate, add sample_type

52d28f7

jklugherz changed the title ~~add sample qc task and first metric - filtered_callrate~~ add sample qc task and filter_flags Feb 12, 2025

jklugherz added 2 commits February 13, 2025 10:35

?

170af80

Merge remote-tracking branch 'origin/dev' into sample-qc-filtered-cal…

7d14822

…lrate

bpblanken reviewed Feb 13, 2025

View reviewed changes

v03_pipeline/lib/methods/sample_qc.py Outdated Show resolved Hide resolved

jklugherz added 2 commits February 13, 2025 12:54

json

23aae3c

delete

c64bf5e

jklugherz requested a review from bpblanken February 13, 2025 17:56

jklugherz added 4 commits February 13, 2025 13:29

try assertcountequal

aa82644

array

7509850

formatting

d8e6ce2

Merge remote-tracking branch 'origin/dev' into sample-qc-filtered-cal…

ac128f9

…lrate

jklugherz added 4 commits February 13, 2025 15:08

oops

f5c43e9

mock something else

bbf4ad6

r

5350b5e

Merge remote-tracking branch 'origin/dev' into sample-qc-filtered-cal…

96dbc1e

…lrate

jklugherz commented Feb 14, 2025

View reviewed changes

jklugherz requested a review from matren395 February 14, 2025 22:17

deleted too many files

c8526d6

bpblanken reviewed Feb 19, 2025

View reviewed changes

jklugherz commented Feb 19, 2025

View reviewed changes

v03_pipeline/lib/methods/sample_qc.py Outdated Show resolved Hide resolved

jklugherz added 2 commits February 19, 2025 13:43

comments

2770d0a

do not drop columns

5580bfc

jklugherz requested a review from bpblanken February 20, 2025 16:41

bpblanken reviewed Feb 25, 2025

View reviewed changes

review

0aac554

bpblanken approved these changes Mar 4, 2025

View reviewed changes

jklugherz merged commit 48537ac into dev Mar 17, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add sample qc task and filter_flags #1034

add sample qc task and filter_flags #1034

Uh oh!

jklugherz commented Feb 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jklugherz Feb 14, 2025

Uh oh!

Uh oh!

bpblanken Feb 19, 2025

Uh oh!

jklugherz Feb 19, 2025

Uh oh!

bpblanken Mar 17, 2025

Uh oh!

Uh oh!

Uh oh!

bpblanken Feb 25, 2025

Uh oh!

bpblanken Feb 25, 2025

Uh oh!

bpblanken left a comment

Uh oh!

Uh oh!

Uh oh!



		class WriteSampleQCJsonTaskTest(MockedDatarootTestCase):
		@patch('v03_pipeline.lib.tasks.write_sample_qc_json.WriteTDRMetricsFilesTask')

add sample qc task and filter_flags #1034

add sample qc task and filter_flags #1034

Uh oh!

Conversation

jklugherz commented Feb 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jklugherz Feb 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bpblanken Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

jklugherz Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

bpblanken Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bpblanken Feb 25, 2025

Choose a reason for hiding this comment

Uh oh!

bpblanken Feb 25, 2025

Choose a reason for hiding this comment

Uh oh!

bpblanken left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jklugherz commented Feb 6, 2025 •

edited

Loading