|
19 | 19 | - [GvsExtractAvroFilesForHail](https://dockstore.org/my-workflows/github.com/broadinstitute/gatk/GvsExtractAvroFilesForHail) workflow
|
20 | 20 | - [GvsPrepareRangesCallset](https://dockstore.org/my-workflows/github.com/broadinstitute/gatk/GvsPrepareRangesCallset) workflow
|
21 | 21 | - [GvsExtractCallset](https://dockstore.org/my-workflows/github.com/broadinstitute/gatk/GvsExtractCallset) workflow
|
| 22 | + - [GvsExtractCallsetPgenMerged](https://dockstore.org/my-workflows/github.com/broadinstitute/gatk/GvsExtractCallsetPgenMerged) workflow |
22 | 23 | - [GvsCallsetStatistics](https://dockstore.org/workflows/github.com/broadinstitute/gatk/GvsCallsetStatistics) workflow
|
23 | 24 | - [GvsCalculatePrecisionAndSensitivity](https://dockstore.org/workflows/github.com/broadinstitute/gatk/GvsCalculatePrecisionAndSensitivity) workflow
|
24 | 25 | - [GvsCallsetCost](https://dockstore.org/workflows/github.com/broadinstitute/gatk/GvsCallsetCost) workflow
|
|
92 | 93 | - If you are debugging a Hail-related issue, you may want to set `leave_hail_cluster_running_at_end` to `true` and refer to [the suggestions for debugging issues with Hail](HAIL_DEBUGGING.md).
|
93 | 94 |
|
94 | 95 | 1. `GvsCallsetStatistics` workflow
|
95 |
| - - You will need to run `GvsPrepareRangesCallset` workflow first, if it has not been run already |
| 96 | + - You will need to run `GvsPrepareRangesCallset` workflow for callset statistics first, if it has not been run already. |
96 | 97 | - This workflow transforms the data in the vet tables into a schema optimized for callset stats creation and for calculating sensitivity and precision.
|
97 | 98 | - The `only_output_vet_tables` input should be set to `true` (the default value is `false`).
|
98 | 99 | - The `enable_extract_table_ttl` input should be set to `true` (the default value is `false`), which will add a TTL of two weeks to the tables it creates.
|
|
102 | 103 | - This workflow needs to be run with the `extract_table_prefix` input from `GvsPrepareRangesCallset` step.
|
103 | 104 | - This workflow needs to be run with the `filter_set_name` input from `GvsCreateFilterSet` step.
|
104 | 105 | - This workflow does not use the Terra Data Entity Model to run, so be sure to select the `Run workflow with inputs defined by file paths` workflow submission option.
|
| 106 | +1. `GvsExtractCallset` / `GvsExtractCallsetPgenMerged` workflow |
| 107 | + - You will need to run the `GvsPrepareRangesCallset` workflow for each "[Region](https://support.researchallofus.org/hc/en-us/articles/14929793660948-Smaller-Callsets-for-Analyzing-Short-Read-WGS-SNP-Indel-Data-with-Hail-MT-VCF-and-PLINK)" (interval list) for which a PGEN or VCF deliverable is required for the callset. |
| 108 | + - This workflow transforms the data in the vet, ref_ranges, and samples tables into a schema optimized for extract. |
| 109 | + - The `enable_extract_table_ttl` input should be set to `true` (the default value is `false`), which will add a TTL of two weeks to the tables it creates. |
| 110 | + - `extract_table_prefix` should be set to a name that is unique to the given Region / interval list. See the [naming conventions doc](https://docs.google.com/document/d/1pNtuv7uDoiOFPbwe4zx5sAGH7MyxwKqXkyrpNmBxeow) for guidance on what to use. |
| 111 | + - Specify the `interval_list` appropriate for the PGEN / VCF extraction run you are performing. |
| 112 | + - This workflow does not use the Terra Data Entity Model to run, so be sure to select the `Run workflow with inputs defined by file paths` workflow submission option. |
| 113 | + - Specify the same `call_set_identifier`, `dataset_name`, `project_id`, `extract_table_prefix`, and `interval_list` that were used in the `GvsPrepareRangesCallset` run documented above. |
| 114 | + - Specify the `interval_weights_bed` appropriate for the PGEN / VCF extraction run you are performing. `gs://gvs_quickstart_storage/weights/gvs_full_vet_weights_1kb_padded_orig.bed` is the interval weights BED used for Quickstart. |
| 115 | + - These workflows do not use the Terra Data Entity Model to run, so be sure to select the `Run workflow with inputs defined by file paths` workflow submission option. |
105 | 116 | 1. `GvsCalculatePrecisionAndSensitivity` workflow
|
106 | 117 | - Please see the detailed instructions for running the Precision and Sensitivity workflow [here](../../tieout/AoU_PRECISION_SENSITIVITY.md).
|
107 | 118 | 1. `GvsCallsetCost` workflow
|
|
0 commit comments