Skip to content

Commit 8c993dc

Browse files
Merge pull request #51 from torres-alexis/dppd-multiqc-update
Document updates - typo fixes - plot size corrections - specifying the zip command for multiqc reports to match what's published on the OSDR
2 parents 60a56d6 + 2789101 commit 8c993dc

File tree

3 files changed

+45
-30
lines changed

3 files changed

+45
-30
lines changed

Microarray/Affymetrix/Pipeline_GL-DPPD-7114_Versions/GL-DPPD-7114.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -65,7 +65,7 @@ Lauren Sanders (acting GeneLab Project Scientist)
6565
|biomaRt|2.50.0|[https://bioconductor.org/packages/3.14/bioc/html/biomaRt.html](https://bioconductor.org/packages/3.14/bioc/html/biomaRt.html)|
6666
|matrixStats|0.63.0|[https://github.com/HenrikBengtsson/matrixStats](https://github.com/HenrikBengtsson/matrixStats)|
6767
|statmod|1.5.0|[https://github.com/cran/statmod](https://github.com/cran/statmod)|
68-
|dp_tools|1.3.2|[https://github.com/J-81/dp_tools](https://github.com/J-81/dp_tools)|
68+
|dp_tools|1.3.4|[https://github.com/J-81/dp_tools](https://github.com/J-81/dp_tools)|
6969
|singularity|3.9|[https://sylabs.io](https://sylabs.io)|
7070
|Quarto|1.1.251|[https://quarto.org](https://quarto.org)|
7171

@@ -292,7 +292,7 @@ legend("topright", legend = colnames(raw_data@assayData$exprs),
292292
lty = c(1,2,3,4,5), # Seems like oligo::hist cycles through these first five line types
293293
col = oligo::darkColors(n = ncol(raw_data)), # Ensure legend color is in sync with plot
294294
ncol = number_of_sets, # Set number of columns by number of sets
295-
cex = 1 + 0.2 - (number_of_sets*0.2) # Reduce scale by 20% for each column beyond 1
295+
cex = max(0.35, 1 + 0.2 - (number_of_sets*0.2)) # Reduce scale by 20% for each column beyond 1 with minimum of 35%
296296
)
297297

298298
# Reset par
@@ -475,7 +475,7 @@ legend("topright", legend = colnames(norm_data@assayData$exprs),
475475
lty = c(1,2,3,4,5), # Seems like oligo::hist cycles through these first five line types
476476
col = oligo::darkColors(n = ncol(norm_data)), # Ensure legend color is in sync with plot
477477
ncol = number_of_sets, # Set number of columns by number of sets
478-
cex = 1 + 0.2 - (number_of_sets*0.2) # Reduce scale by 20% for each column beyond 1
478+
cex = max(0.35, 1 + 0.2 - (number_of_sets*0.2)) # Reduce scale by 20% for each column beyond 1 with minimum of 35%
479479
)
480480

481481
# Reset par
@@ -1219,4 +1219,4 @@ write.csv(norm_data_matrix_annotated, file.path(DIR_NORMALIZED_EXPRESSION, "norm
12191219
- **raw_intensities_probe.csv** (table containing the background corrected, unnormalized probe intensity values for each sample including gene annotations. The ProbeID is the unique index column.)
12201220
- **normalized_intensities_probe.csv** (table containing the background corrected, normalized probe intensity values for each sample including gene annotations. The ProbeID is the unique index column.)
12211221

1222-
> All steps of the Microarray pipeline are performed using R markdown and the completed R markdown is rendered (via Quarto) as an html file (**NF_MAAffymetrix_\*.html**) and published in the [Open Science Data Repository (OSDR)](https://osdr.nasa.gov/bio/repo/) for the respective dataset.
1222+
> All steps of the Microarray pipeline are performed using R markdown and the completed R markdown is rendered (via Quarto) as an html file (**NF_MAAffymetrix_\*.html**) and published in the [Open Science Data Repository (OSDR)](https://osdr.nasa.gov/bio/repo/) for the respective dataset.

RNAseq/Pipeline_GL-DPPD-7101_Versions/GL-DPPD-7101-F.md

Lines changed: 40 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -168,7 +168,8 @@ fastqc -o /path/to/raw_fastqc/output/directory *.fastq.gz
168168
### 1b. Compile Raw Data QC
169169

170170
```bash
171-
multiqc --interactive -n raw_multiqc -o /path/to/raw_multiqc/output/directory /path/to/directory/containing/raw_fastqc/files
171+
multiqc --interactive -n raw_multiqc -o /path/to/raw_multiqc/output/raw_multiqc_report /path/to/directory/containing/raw_fastqc/files
172+
zip -r raw_multiqc_report.zip /path/to/raw_multiqc/output/raw_multiqc_report
172173
```
173174

174175
**Parameter Definitions:**
@@ -184,8 +185,9 @@ multiqc --interactive -n raw_multiqc -o /path/to/raw_multiqc/output/directory /p
184185

185186
**Output Data:**
186187

187-
- **raw_multiqc.html** (multiqc report)
188-
- **/raw_multiqc_data** (directory containing multiqc data)
188+
* **raw_multiqc_report.zip** (zip containing the following)
189+
* **raw_multiqc.html** (multiqc output html summary)
190+
* **raw_multiqc_data** (directory containing multiqc output data)
189191

190192
<br>
191193

@@ -257,7 +259,8 @@ fastqc -o /path/to/trimmed_fastqc/output/directory *.fastq.gz
257259
### 2c. Compile Trimmed Data QC
258260

259261
```bash
260-
multiqc --interactive -n trimmed_multiqc -o /path/to/trimmed_multiqc/output/directory /path/to/directory/containing/trimmed_fastqc/files
262+
multiqc --interactive -n trimmed_multiqc -o /path/to/trimmed_multiqc/output/trimmed_multiqc_report /path/to/directory/containing/trimmed_fastqc/files
263+
zip -r trimmed_multiqc_report.zip /path/to/trimmed_multiqc/output/trimmed_multiqc_report
261264
```
262265

263266
**Parameter Definitions:**
@@ -273,8 +276,9 @@ multiqc --interactive -n trimmed_multiqc -o /path/to/trimmed_multiqc/output/dire
273276

274277
**Output Data:**
275278

276-
- **trimmed_multiqc.html** (multiqc report)
277-
- **/trimmed_multiqc_data** (directory containing multiqc data)
279+
* **trimmed_multiqc_report.zip** (zip containing the following)
280+
* **trimmed_multiqc.html** (multiqc output html summary)
281+
* **trimmed_multiqc_data** (directory containing multiqc output data)
278282

279283
<br>
280284

@@ -420,7 +424,8 @@ STAR --twopassMode Basic \
420424
### 4b. Compile Alignment Logs
421425

422426
```bash
423-
multiqc --interactive -n align_multiqc -o /path/to/aligned_multiqc/output/directory /path/to/*Log.final.out/files
427+
multiqc --interactive -n align_multiqc -o /path/to/align_multiqc/output/align_multiqc_report /path/to/*Log.final.out/files
428+
zip -r align_multiqc_report.zip /path/to/align_multiqc/output/align_multiqc_report
424429
```
425430

426431
**Parameter Definitions:**
@@ -436,8 +441,9 @@ multiqc --interactive -n align_multiqc -o /path/to/aligned_multiqc/output/direct
436441

437442
**Output Data:**
438443

439-
- **align_multiqc.html** (multiqc report)
440-
- **/align_multiqc_data** (directory containing multiqc data)
444+
* **align_multiqc_report.zip** (zip containing the following)
445+
* **align_multiqc.html** (multiqc output html summary)
446+
* **align_multiqc_data** (directory containing multiqc output data)
441447

442448
<br>
443449

@@ -628,7 +634,8 @@ infer_experiment.py -r /path/to/annotation/BED/file \
628634
### 6b. Compile Strandedness Reports
629635

630636
```bash
631-
multiqc --interactive -n infer_exp_multiqc -o /path/to/infer_exp_multiqc/output/directory /path/to/*infer_expt.out/files
637+
multiqc --interactive -n infer_exp_multiqc -o /path/to/infer_exp_multiqc/output/infer_exp_multiqc /path/to/*infer_expt.out/files
638+
zip -r infer_exp_multiqc_report.zip /path/to/infer_exp_multiqc/output/infer_exp_multiqc_report
632639
```
633640

634641
**Parameter Definitions:**
@@ -644,8 +651,9 @@ multiqc --interactive -n infer_exp_multiqc -o /path/to/infer_exp_multiqc/output/
644651

645652
**Output Data:**
646653

647-
- **infer_exp_multiqc.html** (multiqc report)
648-
- **/infer_exp_multiqc_data** (directory containing multiqc data)
654+
* **infer_exp_multiqc_report.zip** (zip containing the following)
655+
* **infer_exp_multiqc.html** (multiqc output html summary)
656+
* **infer_exp_multiqc_data** (directory containing multiqc output data)
649657

650658
<br>
651659

@@ -681,7 +689,8 @@ geneBody_coverage.py -r /path/to/annotation/BED/file \
681689
### 6d. Compile GeneBody Coverage Reports
682690

683691
```bash
684-
multiqc --interactive -n genebody_cov_multiqc -o /path/to/geneBody_coverage_multiqc/output/directory /path/to/geneBody_coverage/output/files
692+
multiqc --interactive -n genebody_cov_multiqc -o /path/to/geneBody_cov_multiqc/output/geneBody_cov_multiqc_report /path/to/geneBody_coverage/output/files
693+
zip -r genebody_cov_multiqc_report.zip /path/to/genebody_cov_multiqc/output/genebody_cov_multiqc_report
685694
```
686695

687696
**Parameter Definitions:**
@@ -697,8 +706,9 @@ multiqc --interactive -n genebody_cov_multiqc -o /path/to/geneBody_coverage_mult
697706

698707
**Output Data:**
699708

700-
- **geneBody_cov_multiqc.html** (multiqc report)
701-
- **/geneBody_cov_multiqc_data** (directory containing multiqc data)
709+
* **genebody_cov_multiqc_report.zip** (zip containing the following)
710+
* **genebody_cov_multiqc.html** (multiqc output html summary)
711+
* **genebody_cov_multiqc_data** (directory containing multiqc output data)
702712

703713
<br>
704714

@@ -740,7 +750,8 @@ inner_distance.py -r /path/to/annotation/BED/file \
740750
### 6f. Compile Inner Distance Reports
741751

742752
```bash
743-
multiqc --interactive -n inner_dist_multiqc /path/to/inner_dist_multiqc/output/directory /path/to/inner_dist/output/files
753+
multiqc --interactive -n inner_dist_multiqc /path/to/align_multiqc/output/inner_dist_multiqc_report /path/to/inner_dist/output/files
754+
zip -r inner_dist_multiqc_report.zip /path/to/align_multiqc/output/inner_dist_multiqc_report
744755
```
745756

746757
**Parameter Definitions:**
@@ -756,8 +767,9 @@ multiqc --interactive -n inner_dist_multiqc /path/to/inner_dist_multiqc/output/d
756767

757768
**Output Data:**
758769

759-
- **inner_distance_multiqc.html** (multiqc report)
760-
- **/inner_distance_multiqc_data** (directory containing multiqc data)
770+
* **inner_dist_multiqc_report.zip** (zip containing the following)
771+
* **inner_dist_multiqc.html** (multiqc output html summary)
772+
* **inner_dist_multiqc_data** (directory containing multiqc output data)
761773

762774
<br>
763775

@@ -790,7 +802,8 @@ read_distribution.py -r /path/to/annotation/BED/file \
790802
### 6h. Compile Read Distribution Reports
791803

792804
```bash
793-
multiqc --interactive -n read_dist_multiqc -o /path/to/read_dist_multiqc/output/directory /path/to/*read_dist.out/files
805+
multiqc --interactive -n read_dist_multiqc -o /path/to/read_dist_multiqc/output/read_dist_multiqc_report /path/to/*read_dist.out/files
806+
zip -r read_dist_multiqc_report.zip /path/to/read_dist_multiqc/output/read_dist_multiqc_report
794807
```
795808

796809
**Parameter Definitions:**
@@ -806,8 +819,9 @@ multiqc --interactive -n read_dist_multiqc -o /path/to/read_dist_multiqc/output/
806819

807820
**Output Data:**
808821

809-
- **read_dist_multiqc.html** (multiqc report)
810-
- **/read_dist_multiqc_data** (directory containing multiqc data)
822+
* **read_dist_multiqc_report.zip** (zip containing the following)
823+
* **read_dist_multiqc.html** (multiqc output html summary)
824+
* **read_dist_multiqc_data** (directory containing multiqc output data)
811825

812826
<br>
813827

@@ -819,7 +833,6 @@ multiqc --interactive -n read_dist_multiqc -o /path/to/read_dist_multiqc/output/
819833
rsem-prepare-reference --gtf /path/to/annotation/gtf/file \
820834
/path/to/genome/fasta/file \
821835
/path/to/RSEM/genome/directory/RSEM_ref_prefix
822-
823836
```
824837

825838
**Parameter Definitions:**
@@ -904,7 +917,8 @@ rsem-calculate-expression --num-threads NumberOfThreads \
904917
### 8b. Compile RSEM Count Logs
905918

906919
```bash
907-
multiqc --interactive -n RSEM_count_multiqc -o /path/to/RSEM_count_multiqc/output/directory /path/to/*stat/files
920+
multiqc --interactive -n RSEM_count_multiqc -o /path/to/RSEM_count_multiqc/output/RSEM_count_multiqc_report /path/to/*stat/files
921+
zip -r RSEM_count_multiqc_report.zip /path/to/raw_multiqc/output/RSEM_count_multiqc_report
908922
```
909923

910924
**Parameter Definitions:**
@@ -923,8 +937,9 @@ multiqc --interactive -n RSEM_count_multiqc -o /path/to/RSEM_count_multiqc/outpu
923937

924938
**Output Data:**
925939

926-
- **RSEM_count_multiqc.html** (multiqc report)
927-
- **/RSEM_count_multiqc_data** (directory containing multiqc data)
940+
* **RSEM_count_multiqc_report.zip** (zip containing the following)
941+
* **RSEM_count_multiqc.html** (multiqc output html summary)
942+
* **RSEM_count_multiqc_data** (directory containing multiqc output data)
928943

929944
<br>
930945

Raw_Data_Generation/UPX_kit_for_bulk_RNAseq/Pipeline_GL-DPPD-7108_Versions/GL-DPPD-7108.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
> **This page holds an overview and instructions for how GeneLab generates raw (bulk) RNA sequence data from libraries prepared [in-house](https://genelab.nasa.gov/genelab-sequencing-services/omics_data) using the [Qiagen UPX kit](https://www.qiagen.com/us/products/discovery-and-translational-research/next-generation-sequencing/rna-sequencing/three-rnaseq/qiaseq-upx-3-transcriptome-kits/). Raw data output files and processing protocol are provided for each GeneLab-generated GLDS dataset hosted in the [Open Science Data Repository (OSDR)](https://osdr.nasa.gov/bio/repo/).
44
> The instructions below assume that the samples were prepared as follows:**
5-
> 1. Bulk RNA was was extracted from samples
5+
> 1. Bulk RNA was extracted from samples
66
> 2. Each sample was prepared with a unique cell ID
77
> 3. Samples were pooled together to create sample pools
88
> 4. Each sample pool was prepared with a unique single index

0 commit comments

Comments
 (0)