rsv-b-qc

A companion pipeline to BCCDC-PHL/rsv-b-artic-nf for quality-control analysis. This pipeline is a modified version of BCCDC-PHL/mpxv-artic-nf, which was based directly on jts/ncov-tools, with some adjustments made for monkeypox virus analysis.

flowchart TD
run_dir[run_dir]
primer_bed[primer.bed]
primer_pairs[primer_pairs.tsv]
run_dir -- consensus --> identify_complete_genomes(identify_complete_genomes)
identify_complete_genomes --> prepare_multi_fasta(prepare_multi_fasta)
nextclade_dataset(nextclade_dataset)
nextclade_dataset -- dataset --> nextclade(nextclade)
prepare_multi_fasta --> nextclade
nextclade -- qc --> nextclade_qc.tsv
prepare_multi_fasta --> augur_align(augur_align)
nextclade_dataset -- ref --> augur_align
augur_align --> augur_tree(augur_tree)
nextclade_dataset -- ref --> augur_tree
augur_align --> make_alleles(make_alleles)
nextclade_dataset -- ref --> make_alleles
augur_tree --> plot_tree_snps(plot_tree_snps)
make_alleles --> plot_tree_snps
nextclade -- qc --> plot_tree_snps
plot_tree_snps --> tree_snps.pdf
run_dir -- variants --> snpeff(snpeff)
nextclade_dataset -- ref --> snpeff
snpeff --> make_aa_table(make_aa_table)
primer_bed --> primer_bed_to_amplicon_bed(primer_bed_to_amplicon_bed)
primer_pairs --> primer_bed_to_amplicon_bed
run_dir -- alignment --> calc_amplicon_depth(calc_amplicon_depth)
primer_bed_to_amplicon_bed --> calc_amplicon_depth
run_dir -- variants --> create_primer_snp_bed(create_primer_snp_bed)
primer_bed --> create_primer_snp_bed
nextclade_dataset -- ref --> make_genome_bed(make_genome_bed)
run_dir -- alignments --> calc_per_base_depth(calc_per_base_depth)
make_genome_bed --> calc_per_base_depth
make_sample_qc_summary(make_sample_qc_summary)
run_dir -- consensus --> make_sample_qc_summary
run_dir -- variants --> make_sample_qc_summary
calc_per_base_depth --> make_sample_qc_summary
make_alleles --> make_sample_qc_summary
make_sample_qc_summary --> write_qc_summary(write_qc_summary)
write_qc_summary --> summary_qc.tsv

Usage

This pipeline makes some assumptions about the directory structure of the input dataset. We assume that there is a directory below the directory passed via the --run_dir flag, named like: rsv-b-artic-nf-vX.Y-output, which contains output from the BCCDC-PHL/rsv-b-artic-nf pipeline.

nextflow run BCCDC-PHL/rsv-b-qc \
  -profile conda \
  --cache ~/.conda/envs \
  --run_dir </path/to/run_dir> \
  --bed </path/to/rsv-a.scheme.bed> \
  --primer_pairs_tsv </path/to/primer_pairs.tsv> \
  --outdir <output_dir>

Parameters

Name	Default	Description
`minimum_genome_completeness`	`85.0`	Genome completeness threshold below which samples will be excluded from tree/SNPs plot.
`partial_genome_threshold`	`85.0`	Genome completeness threshold below which samples will be tagged as `PARITAL_GENOME`
`incomplete_genome_threshold`	`50.0`	Genome completeness threshold below which samples will be tagged as `INCOMPLETE_GENOME`
`excess_ambiguity_threshold`	`5`	Number of ambiguous bases in consensus above which samples will be tagged as `EXCESS_AMBIGUITY`

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github		.github
bin		bin
environments		environments
modules		modules
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

rsv-b-qc

Usage

Parameters

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

BCCDC-PHL/rsv-b-qc

Folders and files

Latest commit

History

Repository files navigation

rsv-b-qc

Usage

Parameters

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages