GitHub

1. Overview

nf-core/dms is a reproducible, scalable, and community-curated pipeline for analyzing deep mutational scanning (DMS) data using shotgun DNA sequencing. DMS enables researchers to measure the fitness effects of thousands of gene variants simultaneously, helping to classify disease causing mutants in human and animal populations, to learn fundamental rules of virus evolution, protein architecture, splicing or small-molecule interactions.

While DNA synthesis and sequencing technologies have advanced substantially, long open reading frame (ORF) targets still present major challenges for DMS studies. Shotgun DNA sequencing can be used to greatly speed up the inference of long ORF mutant fitness landscapes, theoretically at no expense in accuracy. We have designed the nf-core/dms pipeline to unlock the power of shotgun sequencing based DMS studies, to simplify and standardise the complex bioinformatics steps involved in data processing of such experiments – from read alignment to QC reporting and fitness landscape inferences.

📄 Reference: Wehnert et al., bioRxiv preprint (coming soon)

2. Features of nf-core/dms

End-to-end analyses of DMS shotgun sequencing data
Modular, three-stage workflow: alignment → QC → error-aware fitness estimation
Integrates with popular statistical tools like DiMSum, Enrich2, Rosace and mutscan
Supports multiple mutagenesis strategies, e.g. nicking by NNK and NNS codons
Containerized via Docker, Singularity and Apptainer
Scalable across HPC and Cloud systems
Monitors CPU, memory, and CO₂ usage

3. Installation

nf-core/dms uses Nextflow, which must be installed on your system:

java -version                           # Check that Java v11+ is installed
curl -s https://get.nextflow.io | bash  # Download Nextflow
chmod +x nextflow                       # Make executable
mv nextflow ~/bin/                      # Add to user's $PATH

The pipeline itself requires no installation – Nextflow will fetch it directly from GitHub:

nextflow run nf-core/dms -profile docker

4. Usage

Prepare:

A sample sheet CSV to specify input/output labels, replicates, etc. (see example)
A reference FASTA file for the gene or region of interest

To execute nf-core/dms, run the basic command:

nextflow run nf-core/dms \
  -profile singularity,local \
  --input ./input.csv \
  --outdir ./results \
  --fasta ./ref.fa \
  --reading-frame 1-300 \
  --mutagenesis NNK-NNS \
  --seq-rarefaction false

Required parameters

Parameter	Description
`--input`	Path to sample sheet CSV
`--outdir`	Path to output directory
`--fasta`	Reference FASTA file
`--reading_frame`	Start and end nucleotide (e.g. `1-300`)

Optional parameters

Parameter	Default	Description
`--read-align`	`bwa-mem`	Read aligner
`--mutagenesis`	`NNK-NNS`	Deep mutational scanning strategy used
`--seq-rarefaction`	`false`	Estimate sequencing saturation by rarefaction
`--error-estimation`	`input`	Error model used to correct 1nt counts
`--fitness-estimation`	`dimsum`	Downstream fitness inference module

More options and advanced configuration: see vignette

5. Input Data

The primary pipeline input is a sample sheet .csv file listing:

Paths to paired-end .fastq.gz files from shotgun sequencing
Their classification as either input or output samples
Replicate IDs
Associated experimental metadata

See sample CSV for formatting.

6. Output Data

After execution, the pipeline creates the following directory structure:

results/
├── plots/               # PDF visualizations: coverage, variant heatmaps, etc.
├── intermediate_files/  # Raw alignments, filtered variant tables, QC reports
├── final_files/         # Fitness and error tables from downstream tools
├── timeline.html        # Runtime timeline
└── report.html          # Summary report incl. resource and CO₂ usage

7. Citation

If you use this pipeline in your research, please cite:

📄 Wehnert et al., bioRxiv preprint (coming soon)

Please also cite the nf-core framework:

📄 Ewels et al., Nature Biotechnology, 2020
https://doi.org/10.1038/s41587-020-0439-x

8. License

MIT License

9. Contributing

We welcome contributions from the community!

Please open an issue or pull request via this GitHub page, to:

Suggest or help implementing new modules for custom workflows
Report bugs and other challenges in running nf-core/dms
Help improve this documentation

You can also reach out to us via the nf-core Slack, by use of the #dms channel (join here).

10. Contact

For detailled scientific or technical questions, feedback and experimental discussions, feel free to contact us directly:

Benjamin Wehnert — wehnertbenjamin@gmail.com
Taylor Mighell — taylor.mighell@crg.eu
Fei Sang — fs18@sanger.ac.uk
Ben Lehner — bl11@sanger.ac.uk
Maximilian Stammnitz — maximilian.stammnitz@crg.eu

Name		Name	Last commit message	Last commit date
Latest commit History 228 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
assets		assets
conf		conf
docs		docs
modules		modules
subworkflows		subworkflows
workflows		workflows
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
.nf-core.yml		.nf-core.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc.yml		.prettierrc.yml
CHANGELOG.md		CHANGELOG.md
CITATIONS.md		CITATIONS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
log.txt		log.txt
main.nf		main.nf
modules.json		modules.json
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
ro-crate-metadata.json		ro-crate-metadata.json
tower.yml		tower.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

1. Overview

2. Features of nf-core/dms

3. Installation

4. Usage

Required parameters

Optional parameters

5. Input Data

6. Output Data

7. Citation

8. License

9. Contributing

10. Contact

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

BenjaminWehnert1008/dmscore

Folders and files

Latest commit

History

Repository files navigation

1. Overview

2. Features of nf-core/dms

3. Installation

4. Usage

Required parameters

Optional parameters

5. Input Data

6. Output Data

7. Citation

8. License

9. Contributing

10. Contact

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages