Skip to content

Commit 0b756c9

Browse files
Merge pull request #65 from torres-alexis/rna-microarray-release-prep
NF_RCP-F_1.0.4, NF_MAAffymetrix_1.0.3, NF_MAAgilent1ch_1.0.2 release prep - typo and path fixes
2 parents c05a745 + f85b533 commit 0b756c9

File tree

10 files changed

+13
-13
lines changed

10 files changed

+13
-13
lines changed

GeneLab_Reference_Annotations/Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110/GL-DPPD-7110.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -217,7 +217,7 @@ string_db <- STRINGdb$new(version = "11.5", species = target_taxid, score_thresh
217217
string_map <- string_db$map(annot, primary_keytype, removeUnmappedRows = FALSE, takeFirst = FALSE)
218218

219219
## Create a table using the gene IDs of the primary keytype as row names and a column containing STRING IDs. ##
220-
## For genes containig multiple STRING IDs, combine all STRING IDs for each gene into one row and separate each ID with a '|' ##
220+
## For genes containing multiple STRING IDs, combine all STRING IDs for each gene into one row and separate each ID with a '|' ##
221221
tab_with_multiple_STRINGids_combined <-
222222
data.frame(row.names = annot[[primary_keytype]])
223223

@@ -256,7 +256,7 @@ pthOrganisms(PANTHER.db) <- target_organism
256256
## Use ENTREZ IDs to map genes to respective PANTHER GO slim annotation(s) ##
257257
# Note: Since there can be none (indicated in the annotation table as "NA"), one, or
258258
# multiple ENTREZ IDs for a gene, this section contains 3 distinct parts to handle
259-
# each of those scenarios and create a new column in the annotation table containg the GO slim IDs
259+
# each of those scenarios and create a new column in the annotation table containing the GO slim IDs
260260

261261
for ( curr_row in 1:dim(annot)[1] ) {
262262

GeneLab_Reference_Annotations/Workflow_Documentation/GL_RefAnnotTable/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -71,7 +71,7 @@ Rscript GL-DPPD-7110_build-genome-annots-tab.R MOUSE
7171

7272
**Input data:**
7373

74-
- No input files required, but a target organism must be specified as a positional command line argument, `MOUSE` is used in the example above. Run `Rscript GL-DPPD-7110_build-genome-annots-tab.R` with no positional arugments to see the list of currently available organisms.
74+
- No input files required, but a target organism must be specified as a positional command line argument, `MOUSE` is used in the example above. Run `Rscript GL-DPPD-7110_build-genome-annots-tab.R` with no positional arguments to see the list of currently available organisms.
7575

7676
**Output data:**
7777

GeneLab_Reference_Annotations/Workflow_Documentation/GL_RefAnnotTable/workflow_code/GL-DPPD-7110_build-genome-annots-tab.R

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -263,7 +263,7 @@ string_map <- string_db$map(annot, primary_keytype, removeUnmappedRows = FALSE,
263263
## Adding some blank lines just for spacing on print-out ##
264264
cat("\n\n")
265265

266-
## Create a table using the gene IDs of the primary keytype as row names and a column containing STRING IDs. For genes containig multiple STRING IDs, combine all STRING IDs for each gene into one row and separate each ID with a '|' ##
266+
## Create a table using the gene IDs of the primary keytype as row names and a column containing STRING IDs. For genes containing multiple STRING IDs, combine all STRING IDs for each gene into one row and separate each ID with a '|' ##
267267

268268
tab_with_multiple_STRINGids_combined <-
269269
data.frame(row.names = annot[[primary_keytype]])
@@ -303,7 +303,7 @@ pthOrganisms(PANTHER.db) <- target_organism
303303

304304
## Use ENTREZ IDs to map genes to respective PANTHER GO slim annotation(s) ##
305305

306-
## Note: Since there can be none (indicated in the annotation table as "NA"), one, or multiple ENTREZ IDs for a gene, this section contains 3 distinct parts to handle each of those scenarios and create a new column in the annotation table containg the GO slim IDs ##
306+
## Note: Since there can be none (indicated in the annotation table as "NA"), one, or multiple ENTREZ IDs for a gene, this section contains 3 distinct parts to handle each of those scenarios and create a new column in the annotation table containing the GO slim IDs ##
307307

308308
for ( curr_row in 1:dim(annot)[1] ) {
309309

Microarray/Affymetrix/Workflow_Documentation/NF_MAAffymetrix/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -97,7 +97,7 @@ All files required for utilizing the NF_MAAffymetrix GeneLab workflow for proces
9797
copy of latest NF_MAAffymetrix version on to your system, the code can be downloaded as a zip file from the release page then unzipped after downloading by running the following commands:
9898
9999
```bash
100-
wget https://github.com/asaravia-butler/GeneLab_Data_Processing/releases/download/NF_MAAffymetrix_1.0.3/NF_MAAffymetrix_1.0.3.zip
100+
wget https://github.com/nasa/GeneLab_Data_Processing/releases/download/NF_MAAffymetrix_1.0.3/NF_MAAffymetrix_1.0.3.zip
101101
102102
unzip NF_MAAffymetrix_1.0.3.zip
103103
```

Microarray/Affymetrix/Workflow_Documentation/NF_MAAffymetrix/workflow_code/modules/POST_PROCESSING/GENERATE_PROTOCOL/resources/usr/bin/generate_protocol.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -58,7 +58,7 @@ else
5858
fi
5959

6060
# Read the template file
61-
template="Data were processed as described in GL-DPPD-7114 (https://github.com/nasa/GeneLab_Data_Processing/blob/master/Microarray/Affymetrix/Pipeline_GL-DPPD-7114_Versions/GL-DPPD-7114.md) using NF_MAAffymetrix version 1.0.3 (GitHub link coming soon). In short, a RunSheet containing raw data file location and processing metadata from the study's *ISA.zip file was generated using dp_tools (version ${dp_tools_VERSION}). The raw array data files were loaded into R (version ${R_VERSION}) using oligo (version ${oligo_VERSION}). Raw data quality assurance density plot, pseudo images, MA plots, and boxplots were generated using oligo (version ${oligo_VERSION}). The raw probe level intensity data was background corrected and normalized across arrays via the oligo (version ${oligo_VERSION}) quantile method. Normalized probe level data quality assurance density plot, pseudo images, MA plots, and boxplots were generated using oligo (version ${oligo_VERSION}). Normalized probe level data was summarized to the probeset level using the oligo (version ${oligo_VERSION}) RMA method. ${GENE_MAPPING_STEP} Differential expression analysis was performed in R (version ${R_VERSION}) using limma (version ${limma_VERSION}); all groups were compared pairwise for each probeset to generate a moderated t-statistic and associated p- and adjusted p-value. Gene annotations were assigned for every probeset that mapped to exactly one Ensembl gene ID using the custom annotation tables generated in-house as detailed in GL-DPPD-7110 (https://github.com/nasa/GeneLab_Data_Processing/blob/GL_RefAnnotTable_1.0.0/GeneLab_Reference_Annotations/Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110/GL-DPPD-7110.md), with STRINGdb (version 2.8.4), PANTHER.db (version 1.0.11), and ${GENE_ANNOTATION_DB} (version 3.15.0)."
61+
template="Data were processed as described in GL-DPPD-7114 (https://github.com/nasa/GeneLab_Data_Processing/blob/master/Microarray/Affymetrix/Pipeline_GL-DPPD-7114_Versions/GL-DPPD-7114.md) using NF_MAAffymetrix version 1.0.3 (https://github.com/nasa/GeneLab_Data_Processing/tree/NF_MAAffymetrix_1.0.3/Microarray/Affymetrix/Workflow_Documentation/NF_MAAffymetrix). In short, a RunSheet containing raw data file location and processing metadata from the study's *ISA.zip file was generated using dp_tools (version ${dp_tools_VERSION}). The raw array data files were loaded into R (version ${R_VERSION}) using oligo (version ${oligo_VERSION}). Raw data quality assurance density plot, pseudo images, MA plots, and boxplots were generated using oligo (version ${oligo_VERSION}). The raw probe level intensity data was background corrected and normalized across arrays via the oligo (version ${oligo_VERSION}) quantile method. Normalized probe level data quality assurance density plot, pseudo images, MA plots, and boxplots were generated using oligo (version ${oligo_VERSION}). Normalized probe level data was summarized to the probeset level using the oligo (version ${oligo_VERSION}) RMA method. ${GENE_MAPPING_STEP} Differential expression analysis was performed in R (version ${R_VERSION}) using limma (version ${limma_VERSION}); all groups were compared pairwise for each probeset to generate a moderated t-statistic and associated p- and adjusted p-value. Gene annotations were assigned for every probeset that mapped to exactly one Ensembl gene ID using the custom annotation tables generated in-house as detailed in GL-DPPD-7110 (https://github.com/nasa/GeneLab_Data_Processing/blob/GL_RefAnnotTable_1.0.0/GeneLab_Reference_Annotations/Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110/GL-DPPD-7110.md), with STRINGdb (version 2.8.4), PANTHER.db (version 1.0.11), and ${GENE_ANNOTATION_DB} (version 3.15.0)."
6262

6363
# Output the filled template
6464
echo "$template" > PROTOCOL_GLmicroarray.txt

Microarray/Affymetrix/Workflow_Documentation/NF_MAAffymetrix/workflow_code/nextflow.config

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -40,8 +40,8 @@ profiles {
4040
}
4141

4242
manifest {
43-
homePage = ''
44-
description = 'Affymetrx Microarray Workflow for Document GL-DPPD-7114'
43+
homePage = 'https://github.com/nasa/GeneLab_Data_Processing/tree/master/Microarray/Affymetrix'
44+
description = 'Affymetrix Microarray Workflow for Document GL-DPPD-7114'
4545
mainScript = 'main.nf'
4646
defaultBranch = 'main'
4747
nextflowVersion = '>=22.10.0'

Microarray/Agilent_1-channel/Workflow_Documentation/NF_MAAgilent1ch/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ We recommend installing Singularity on a system wide level as per the associated
9393
All files required for utilizing the NF_MAAgilent1ch GeneLab workflow for processing Agilent 1 Channel Microarray data are in the [workflow_code](workflow_code) directory. To get a copy of latest NF_MAAgilent1ch version on to your system, the code can be downloaded as a zip file from the release page then unzipped after downloading by running the following commands:
9494
9595
```bash
96-
wget https://github.com/asaravia-butler/GeneLab_Data_Processing/releases/download/NF_MAAgilent1ch_1.0.2/NF_MAAgilent1ch_1.0.2.zip
96+
wget https://github.com/nasa/GeneLab_Data_Processing/releases/download/NF_MAAgilent1ch_1.0.2/NF_MAAgilent1ch_1.0.2.zip
9797
9898
unzip NF_MAAgilent1ch_1.0.2.zip
9999
```

Microarray/Agilent_1-channel/Workflow_Documentation/NF_MAAgilent1ch/workflow_code/nextflow.config

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ profiles {
4040
}
4141

4242
manifest {
43-
homePage = ''
43+
homePage = 'https://github.com/nasa/GeneLab_Data_Processing/tree/master/Microarray/Agilent_1-channel'
4444
description = 'Agilent 1 Channel Microarray Workflow for Document GL-DPPD-7112'
4545
mainScript = 'main.nf'
4646
defaultBranch = 'main'

RNAseq/Workflow_Documentation/NF_RCP-F/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -101,7 +101,7 @@ All files required for utilizing the NF_RCP-F GeneLab workflow for processing RN
101101
copy of latest NF_RCP-F version on to your system, the code can be downloaded as a zip file from the release page then unzipped after downloading by running the following commands:
102102
103103
```bash
104-
wget https://github.com/nasa/GeneLab_Data_Processing/releases/download/NF_RCP-F_4/NF_RCP-F_1.0.4.zip
104+
wget https://github.com/nasa/GeneLab_Data_Processing/releases/download/NF_RCP-F_1.0.4/NF_RCP-F_1.0.4.zip
105105
106106
unzip NF_RCP-F_1.0.4.zip
107107
```

RNAseq/Workflow_Documentation/NF_RCP-F/workflow_code/nextflow.config

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ profiles {
4747
}
4848

4949
manifest {
50-
homePage = 'https://github.com/nasa/GeneLab_Data_Processing/tree/jonathan-branch/RNAseq'
50+
homePage = 'https://github.com/nasa/GeneLab_Data_Processing/tree/master/RNAseq'
5151
description = 'RNA-Seq Pipeline for Document GL-DPPD-7101-F'
5252
mainScript = 'main.nf'
5353
defaultBranch = 'main'

0 commit comments

Comments
 (0)