Skip to content

Commit e879110

Browse files
committed
Refactored diversity and differential abundance testing
1 parent 4a94084 commit e879110

24 files changed

+2392
-948
lines changed

Amplicon/Illumina/Workflow_Documentation/NF_AmpIllumina-B/README.md

Lines changed: 8 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -104,11 +104,11 @@ For options and detailed help on how to run the workflow, run the following comm
104104
nextflow run main.nf --help
105105
```
106106
107-
> Note: Nextflow commands use both single hyphen arguments (e.g. -help) that denote general nextflow arguments and double hyphen arguments (e.g. --csv_file) that denote workflow specific parameters. Take care to use the proper number of hyphens for each argument.
107+
> Note: Nextflow commands use both single hyphen arguments (e.g. -help) that denote general nextflow arguments and double hyphen arguments (e.g. --input_file) that denote workflow specific parameters. Take care to use the proper number of hyphens for each argument.
108108
109109
<br>
110110
111-
#### 4a. Approach 1: Run slurm jobs in singularity containers with OSD accession as input
111+
#### 4a. Approach 1: Run slurm jobs in singularity containers with OSD or GLDS accession as input
112112
113113
```bash
114114
nextflow run main.nf -resume -profile slurm,singularity --accession GLDS-487 --target_region 16S
@@ -119,15 +119,15 @@ nextflow run main.nf -resume -profile slurm,singularity --accession GLDS-487 --t
119119
#### 4b. Approach 2: Run slurm jobs in singularity containers with a csv file as input
120120
121121
```bash
122-
nextflow run main.nf -resume -profile slurm,singularity --csv_file PE_file.csv --target_region 16S --F_primer AGAGTTTGATCCTGGCTCAG --R_primer CTGCCTCCCGTAGGAGT
122+
nextflow run main.nf -resume -profile slurm,singularity --input_file PE_file.csv --target_region 16S --F_primer AGAGTTTGATCCTGGCTCAG --R_primer CTGCCTCCCGTAGGAGT
123123
```
124124
125125
<br>
126126
127127
#### 4c. Approach 3: Run jobs locally in conda environments and specify the path to one or more existing conda environment(s)
128128
129129
```bash
130-
nextflow run main.nf -resume -profile conda --csv_file SE_file.csv --target_region 16S --F_primer AGAGTTTGATCCTGGCTCAG --R_primer CTGCCTCCCGTAGGAGT --conda.qc <path/to/existing/conda/environment>
130+
nextflow run main.nf -resume -profile conda --input_file SE_file.csv --target_region 16S --F_primer AGAGTTTGATCCTGGCTCAG --R_primer CTGCCTCCCGTAGGAGT --conda.qc <path/to/existing/conda/environment>
131131
```
132132
133133
<br>
@@ -145,7 +145,7 @@ nextflow run main.nf -resume -profile conda --csv_file SE_file.csv --target_regi
145145
146146
*Required only if --accession is not passed as an argument*
147147
148-
* `--csv_file` – A 3-column (single-end) or 4-column (paired-end) input csv file (sample_id, forward, [reverse,] paired). Please see the sample [SE_file.csv](workflow_code/SE_file.csv) and [PE_file.csv](workflow_code/PE_file.csv) in this repository for examples on how to format this file.
148+
* `--input_file` – A 4-column (single-end) or 5-column (paired-end) input csv file with the following headers (sample_id, forward, [reverse,] paired, groups). Please see the sample [SE_file.csv](workflow_code/SE_file.csv) and [PE_file.csv](workflow_code/PE_file.csv) in this repository for examples on how to format this file.
149149
150150
* `--F_primer` – Forward primer sequence.
151151
@@ -191,29 +191,21 @@ Standard nextflow resource usage logs are also produced as follows:
191191
For options and detailed help on how to run the post-processing workflow, run the following command:
192192
193193
```bash
194-
nextflow run post_processng.nf --help
194+
nextflow run post_processing.nf --help
195195
```
196196
197197
To generate a README file, a protocols file, a md5sums table and a file association table after running the processing workflow sucessfully, modify and set the parameters in [post_processing.config](workflow_code/post_processing.config) then run the following command:
198198
199199
```bash
200-
nextflow -C post_processing.config run post_processng.nf -resume -profile slurm,singularity
200+
nextflow -C post_processing.config run post_processing.nf -resume -profile slurm,singularity
201201
```
202202
203203
The outputs of the run will be in a directory called `Post_Processing` by default and they are as follows:
204-
205204
- Post_processing/FastQC_Outputs/filtered_multiqc_GLAmpSeq_report.zip (Filtered sequence multiqc report with paths purged)
206-
207205
- Post_processing/FastQC_Outputs/raw_multiqc_GLAmpSeq_report.zip (Raw sequence multiqc report with paths purged)
208-
209206
- Post_processing/<GLDS_accession>_-associated-file-names.tsv (File association table for curation)
210-
211207
- Post_processing/<GLDS_accession>_amplicon-validation.log (Automatic verification and validation log file)
212-
213208
- Post_processing/processed_md5sum_GLAmpSeq.tsv (md5sums for the files to be released on OSDR)
214-
215209
- Post_processing/processing_info_GLAmpSeq.zip (Zip file containing all files used to run the workflow and required logs with paths purged)
216-
217210
- Post_processing/protocol.txt (File describing the methods used by the workflow)
218-
219-
- Post_processing/README_GLAmpSeq.txt (README file listing and describing the outputs of the workflow)
211+
- Post_processing/README_GLAmpSeq.txt (README file listing and describing the outputs of the workflow)
Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
1-
sample_id,forward,reverse,paired
2-
Sample-1,/path/to/raw-reads/Sample-1_R1_raw.fastq.gz,/path/to/raw-reads/Sample-1_R2_raw.fastq.gz,true
3-
Sample-2,/path/to/raw-reads/Sample-2_R1_raw.fastq.gz,/path/to/raw-reads/Sample-2_R2_raw.fastq.gz,true
1+
sample_id,forward,reverse,paired,groups
2+
Sample-1,/path/to/raw-reads/Sample-1_R1_raw.fastq.gz,/path/to/raw-reads/Sample-1_R2_raw.fastq.gz,true,A
3+
Sample-2,/path/to/raw-reads/Sample-2_R1_raw.fastq.gz,/path/to/raw-reads/Sample-2_R2_raw.fastq.gz,true,B
Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
1-
sample_id,forward,paired
2-
Sample-1,/path/to/raw-reads/Sample-1_R1_raw.fastq.gz,false
3-
Sample-2,/path/to/raw-reads/Sample-2_R1_raw.fastq.gz,false
1+
sample_id,forward,paired,groups
2+
Sample-1,/path/to/raw-reads/Sample-1_R1_raw.fastq.gz,false,A
3+
Sample-2,/path/to/raw-reads/Sample-2_R1_raw.fastq.gz,false,B

0 commit comments

Comments
 (0)