Skip to content

Commit b3708d4

Browse files
committed
No duplicates on GRCh38
1 parent baeffcb commit b3708d4

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

v03_pipeline/lib/reference_datasets/topmed.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,8 @@
1515

1616
def get_ht(path: str, reference_genome: ReferenceGenome) -> hl.Table:
1717
ht = vcf_to_ht(path, reference_genome)
18-
ht = ht.filter(ht.locus.position == hl.int(ht.info.OriginalStart))
18+
if reference_genome == ReferenceGenome.GRCh37:
19+
ht = ht.filter(ht.locus.position == hl.int(ht.info.OriginalStart))
1920
return ht.select(
2021
**{k: parse_nested_field(ht, v) for k, v in SELECT.items()},
2122
)

0 commit comments

Comments
 (0)