-
Notifications
You must be signed in to change notification settings - Fork 20
Parameter refactoring + cleanup. #809
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
56 commits
Select commit
Hold shift + click to select a range
4e05a0d
Remove concept of private crdqs
bpblanken 65da04f
lint
bpblanken f90baef
fix logic
bpblanken 7c8cbb2
Move SampleType out of BaseHailTableTask
bpblanken 4a2f978
cleanup
bpblanken fb737f8
fix
bpblanken 2e2676b
missed a few!
bpblanken 89754a4
shitshow
bpblanken 3879c68
First one?
bpblanken 2e1620f
flip order here
bpblanken f93a721
Merge branch 'benb/split_out_sample_type' of github.com:broadinstitut…
bpblanken df3e259
flip order
bpblanken d730d8c
flip order
bpblanken 96342b7
Merge branch 'benb/split_out_sample_type' of github.com:broadinstitut…
bpblanken 98cffbb
try sharing these params?
bpblanken 93cadea
that failed :/
bpblanken cb8b974
more scaffolding
bpblanken 92868de
ruff
bpblanken ec8b1b2
Another one
bpblanken 68bc804
more hacking
bpblanken 0c22f46
Fix write metadata
bpblanken 4dec7b8
typo
bpblanken 379ae36
missing param
bpblanken e3cae7a
typo
bpblanken 6e2d31e
making progress
bpblanken 3d317cf
format
bpblanken eff732a
remove pairing
bpblanken f39db46
another
bpblanken e81a3e0
Merge branch 'dev' of github.com:broadinstitute/seqr-loading-pipeline…
bpblanken b8e7802
Merge branch 'benb/split_out_sample_type' of github.com:broadinstitut…
bpblanken 4cde8f4
missing test liftover
bpblanken 56c341e
merge
bpblanken c26fe1d
Merge branch 'dev' of github.com:broadinstitute/seqr-loading-pipeline…
bpblanken 391073f
ruff
bpblanken 9657823
ruff
bpblanken 907eee4
use predetermined filters/imputed_sex paths
bpblanken f1a8e63
ruff
bpblanken 0270a88
formalize
bpblanken e33306a
lint
bpblanken a9c485b
lint
bpblanken ac83cd0
Fix arg
bpblanken 27c6bd8
Change parameters again
bpblanken 1840e1e
Update a bunch of args
bpblanken a686ce8
missed a few
bpblanken 47c581e
fix liftover
bpblanken 41a23a5
ruff
bpblanken e5b446d
reformat filters annotation
bpblanken 3cd656b
add env vars too!
bpblanken bab5b5e
ruff
bpblanken c6816af
ruff
bpblanken 56c7125
Fix logic
bpblanken 8bfb13b
add env mock
bpblanken 7056e34
Update environment.py
bpblanken 9fddc87
update zip
bpblanken 20f3a20
Merge branch 'benb/single_callset_path' of github.com:broadinstitute/…
bpblanken 8cf46a0
ruff
bpblanken File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
import luigi | ||
|
||
from v03_pipeline.lib.model import SampleType | ||
|
||
|
||
class BaseLoadingRunParams(luigi.Task): | ||
# NB: | ||
# These params are "inherited" with the special | ||
# luigi.util.inherits function, copying params | ||
# but nothing else. | ||
sample_type = luigi.EnumParameter(enum=SampleType) | ||
callset_path = luigi.Parameter() | ||
ignore_missing_samples_when_remapping = luigi.BoolParameter( | ||
default=False, | ||
parsing=luigi.BoolParameter.EXPLICIT_PARSING, | ||
) | ||
force = luigi.BoolParameter( | ||
default=False, | ||
parsing=luigi.BoolParameter.EXPLICIT_PARSING, | ||
) | ||
skip_check_sex_and_relatedness = luigi.BoolParameter( | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Some context on this:
|
||
default=False, | ||
parsing=luigi.BoolParameter.EXPLICIT_PARSING, | ||
) | ||
skip_expect_filters = luigi.BoolParameter( | ||
default=False, | ||
parsing=luigi.BoolParameter.EXPLICIT_PARSING, | ||
) | ||
skip_validation = luigi.BoolParameter( | ||
default=False, | ||
parsing=luigi.BoolParameter.EXPLICIT_PARSING, | ||
) | ||
is_new_gcnv_joint_call = luigi.BoolParameter( | ||
default=False, | ||
description='Is this a fully joint-called callset.', | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. do we also need |
||
) |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what are 'part_one_outputs' and 'part_two_outputs'?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@matren395 can give some context here! I think it's just that internal exome callsets have always been delivered in two different folders, and we run an extra step to join the
filters
part onto the first part!There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's what I'd believe this is too, yes!
WES callsets are delivered in two parts usually, with two folders for Part_One_Outputs and Part_Two_Outputs. Part_One_Outputs are the "normal" sharded VCFs, one set of sharded VCFs per chromosome, complete with tons of annotations and information. Part_Two_Outputs is I believe just one VCF per chromosome, containing the sites to filter the Part_One VCFs to .