Skip to content

Commit c750589

Browse files
committed
fixed README typo
1 parent 6409cbf commit c750589

File tree

1 file changed

+15
-14
lines changed
  • Metagenomics/Illumina/Workflow_Documentation/NF_MGIllumina

1 file changed

+15
-14
lines changed

Metagenomics/Illumina/Workflow_Documentation/NF_MGIllumina/README.md

Lines changed: 15 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -3,12 +3,12 @@
33
## General Workflow Info
44

55
### Implementation Tools
6+
67
The current GeneLab Illumina metagenomics sequencing data processing pipeline (MGIllumina), [GL-DPPD-7107-A.md](../../Pipeline_GL-DPPD-7107_Versions/GL-DPPD-7107-A.md), is implemented as a [Nextflow](https://nextflow.io/) DSL2 workflow and utilizes [Singularity](https://docs.sylabs.io/guides/3.10/user-guide/introduction.html) containers or [conda](https://docs.conda.io/en/latest/) environments to install/run all tools. This workflow is run using the command line interface (CLI) of any unix-based system. While knowledge of creating workflows in nextflow is not required to run the workflow as is, [the Nextflow documentation](https://nextflow.io/docs/latest/index.html) is a useful resource for users who want to modify and/or extend this workflow.
78

89
> **Note on reference databases**
910
> Many reference databases are relied upon throughout this workflow. They will be installed and setup automatically the first time the workflow is run. All together, after installed and unpacked, they will take up about about 340 GB of storage, but they may also require up to 500GB during installation and initial un-packing, so be sure there is enough room on your system before running the workflow.
1011
11-
1212
## Utilizing the Workflow
1313

1414
1. [Install Nextflow and Singularity](#1-install-nextflow-and-singularity)
@@ -19,21 +19,21 @@ The current GeneLab Illumina metagenomics sequencing data processing pipeline (M
1919

2020
3. [Fetch Singularity Images](#3-fetch-singularity-images)
2121

22-
4. [Run the workflow](#3-run-the-workflow)
23-
3a. [Approach 1: Run slurm jobs in singularity containers with OSD accession as input](#3a-approach-1-run-slurm-jobs-in-singularity-containers-with-osd-accession-as-input)
24-
3b. [Approach 2: Run slurm jobs in singularity containers with a csv file as input](#3b-approach-2-run-slurm-jobs-in-singularity-containers-with-a-csv-file-as-input)
25-
3c. [Approach 3: Run jobs locally in conda environments and specify the path to one or more existing conda environments](#3c-approach-run-jobs-locally-in-conda-environments-and-specify-the-path-to-one-or-more-existing-conda-environments)
26-
3d. [Modify parameters and cpu resources in the nextflow config file](#3d-modify-parameters-and-cpu-resources-in-the-nextflow-config-file)
22+
4. [Run the workflow](#4-run-the-workflow)
23+
3a. [Approach 1: Run slurm jobs in singularity containers with OSD accession as input](#4a-approach-1-run-slurm-jobs-in-singularity-containers-with-osd-accession-as-input)
24+
3b. [Approach 2: Run slurm jobs in singularity containers with a csv file as input](#4b-approach-2-run-slurm-jobs-in-singularity-containers-with-a-csv-file-as-input)
25+
3c. [Approach 3: Run jobs locally in conda environments and specify the path to one or more existing conda environments](#4c-approach-run-jobs-locally-in-conda-environments-and-specify-the-path-to-one-or-more-existing-conda-environments)
26+
3d. [Modify parameters and cpu resources in the nextflow config file](#4d-modify-parameters-and-cpu-resources-in-the-nextflow-config-file)
2727

28-
4. [Workflow outputs](#4-workflow-outputs)
29-
4a. [Main outputs](#4a-main-outputs)
30-
4b. [Resource logs](#4b-resource-logs)
28+
5. [Workflow outputs](#5-workflow-outputs)
29+
4a. [Main outputs](#5a-main-outputs)
30+
4b. [Resource logs](#5b-resource-logs)
3131

3232
<br>
3333

3434
---
3535

36-
### 1. Install Nextflow and Singularity
36+
### 1. Install Nextflow and Singularity
3737

3838
#### 1a. Install Nextflow
3939

@@ -70,6 +70,7 @@ All files required for utilizing the NF_XXX GeneLab workflow for processing meta
7070
wget https://github.com/nasa/GeneLab_Data_Processing/releases/download/NF_MGIllumina/NF_MGIllumina.zip
7171
unzip NF_MGIllumina.zip && cd NF_XXX-X_X.X.X
7272
```
73+
7374
<br>
7475
7576
---
@@ -79,13 +80,13 @@ unzip NF_MGIllumina.zip && cd NF_XXX-X_X.X.X
7980
Although Nextflow can fetch Singularity images from a url, doing so may cause issues as detailed [here](https://github.com/nextflow-io/nextflow/issues/1210).
8081
8182
To avoid this issue, run the following command to fetch the Singularity images prior to running the NF_MGIllumina workflow:
83+
8284
> Note: This command should be run in the location containing the `NF_MGIllumina` directory that was downloaded in [step 2](#2-download-the-workflow-files) above.
8385
8486
```bash
8587
bash ./bin/prepull_singularity.sh nextflow.config
8688
```
8789
88-
8990
Once complete, a `singularity` folder containing the Singularity images will be created. Run the following command to export this folder as a Nextflow configuration environment variable to ensure Nextflow can locate the fetched images:
9091
9192
```bash
@@ -135,17 +136,17 @@ nextflow run main.nf -resume -profile conda --csv_file SE_file.csv --conda.qc <p
135136
**Required Parameters For All Approaches:**
136137
137138
* `-run main.nf` - Instructs nextflow to run the NF_XXX workflow
139+
138140
* `-resume` - Resumes workflow execution using previously cached results
139-
* `-profile` – Specifies the configuration profile(s) to load, `singularity` instructs nextflow to setup and use singularity for all software called in the workflow
140-
141141
142+
* `-profile` – Specifies the configuration profile(s) to load, `singularity` instructs nextflow to setup and use singularity for all software called in the workflow
142143
*Required only if you would like to pull and process data directly from OSDR*
143144
144145
* `--GLDS_accession` – A Genelab / OSD accession number e.g. OSD-574.
145146
146147
*Required only if --GLDS_accession is not passed as an argument*
147148
148-
* `--csv_file` – A 3-column (single-end) or 4-column (paired-end) input csv file (sample_id, forward, [reverse,] paired). Please see the sample [SE_file.csv](workflow_code/SE_file.csv)and [PE_file.csv]((workflow_code/PE_file.csv) in this repository for examples on how to format this file.
149+
* `--csv_file` – A 3-column (single-end) or 4-column (paired-end) input csv file (sample_id, forward, [reverse,] paired). Please see the sample [SE_file.csv](workflow_code/SE_file.csv)and [PE_file.csv](workflow_code/PE_file.csv) in this repository for examples on how to format this file.
149150
150151
> See `nextflow run -h` and [Nextflow's CLI run command documentation](https://nextflow.io/docs/latest/cli.html#run) for more options and details on how to run nextflow.
151152

0 commit comments

Comments
 (0)