You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[Step 2: Fetch the Singularity Image](#step-2-fetch-the-singularity-image)
12
12
-[Step 3: Run the Workflow](#step-3-run-the-workflow)
13
-
-[Optional: Run the Annotations Database Creation Function as a Stand-Alone Script via Singularity](#optional-run-the-annotations-database-creation-function-as-a-stand-alone-script-via-singularity)
14
13
-[Approach 2: Using a Local R Environment](#approach-2-using-a-local-r-environment)
15
14
-[Step 1: Install R and Required R Packages](#step-1-install-r-and-required-r-packages)
16
15
-[Step 2: Run the Workflow](#step-2-run-the-workflow)
17
-
-[Optional: Run the Annotations Database Creation Function as a Stand-Alone Script via R](#optional-run-the-annotations-database-creation-function-as-a-stand-alone-script-via-r)
-[3. Run the Annotations Database Creation Function as a Stand-Alone Script](#3-run-the-annotations-database-creation-function-as-a-stand-alone-script)
18
+
-[Using Singularity](#using-singularity)
19
+
-[Using a Local R Environment](#using-a-local-r-environment)
18
20
19
21
<br>
20
22
@@ -93,62 +95,19 @@ Once complete, a `singularity` folder containing the Singularity images will be
93
95
```bash
94
96
export SINGULARITY_CACHEDIR=$(pwd)/singularity
95
97
```
98
+
96
99
<br>
97
100
98
101
#### Step 3: Run the Workflow
99
102
100
-
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder, you can now run the workflow. Below is an example for generating the annotation table for *Mus musculus* (mouse):
103
+
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder that was downloaded in [step 1](#1-download-the-workflow-files), you can now run the workflow. Below is an example for generating the annotation table for *Mus musculus* (mouse):
- No input files are required. Specify the species name of the target organism using a positional command line argument. `Mus musculus` is used in the example above.
113
-
> **Notes**:
114
-
> - To see a list of all available organisms, run `Rscript GL-DPPD-7110-A_build-genome-annots-tab.R` without positional arguments.
115
-
> - The correct argument for each organism can also be found in the 'species' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv)
116
-
117
-
-*Optional*: A local reference table CSV file can be supplied as a second positional argument. If not provided, the script will download the current version of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) table by default.
118
-
119
-
120
-
**Output data:**
121
-
122
-
-*-GL-annotations.tsv (Tab delineated table of gene annotations)
123
-
124
-
-*-GL-build-info.txt (Text file containing information used to create the annotation table, including tool and tool versions and date of creation)
125
-
126
-
<br>
127
-
128
-
#### *Optional*: Run the Annotations Database Creation Function as a Stand-Alone Script via Singularity
129
-
130
-
If the reference table does not specify an annotations database for the target organism in the 'annotations' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) file, the `install_annotations` function (defined in `install-org-db.R`) will be executed by default. This function can also be run as a stand-alone script:
- The species name of the target organism must be specified as the first positional command line argument. `Bacillus subtilis` is used in the example above.
144
-
> **Note**: The correct argument for each organism can also be found in the 'species' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv)
145
-
146
-
-*Optional*: A local reference table CSV file can be supplied as a second positional argument. If not provided, the script will download the current version of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) table by default.
147
-
148
-
149
-
**Output data:**
150
-
151
-
- org.*.eg.db/ (Species-specific annotation database, as a local R package)
152
111
153
112
<br>
154
113
@@ -198,7 +157,7 @@ BiocManager::install("GO.db")
198
157
199
158
#### Step 2: Run the Workflow
200
159
201
-
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder, you can now run the workflow. Below is an example of how to run the workflow to build an annotation table for *Mus musculus* (mouse):
160
+
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder that was downloaded in [step 1](#1-download-the-workflow-files), you can now run the workflow. Below is an example of how to run the workflow to build an annotation table for *Mus musculus* (mouse):
202
161
203
162
204
163
```bash
@@ -207,9 +166,17 @@ Rscript GL_RefAnnotTable-A_1.1.0/GL-DPPD-7110-A_build-genome-annots-tab.R 'Mus m
207
166
208
167
<br>
209
168
169
+
---
170
+
171
+
### Workflow Input/Output Data
172
+
173
+
The input and output data are the same for both [Approach 1: Using Singularity](#approach-1-using-singularity) and [Approach 2: Using a Local R Environment](#approach-2-using-a-local-r-environment).
174
+
175
+
<br>
176
+
210
177
**Input data:**
211
178
212
-
- No input files are required. Specify the species name of the target organism using a positional command line argument. `Mus musculus` is used in the example above.
179
+
- No input files are required. Specify the species name of the target organism using a positional command line argument. `Mus musculus` is used in both the Singularity and the local R environment examples above.
213
180
> **Notes**:
214
181
> - To see a list of all available organisms, run `Rscript GL-DPPD-7110-A_build-genome-annots-tab.R` without positional arguments.
215
182
> - The correct argument for each organism can also be found in the 'species' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv)
@@ -223,12 +190,27 @@ Rscript GL_RefAnnotTable-A_1.1.0/GL-DPPD-7110-A_build-genome-annots-tab.R 'Mus m
223
190
224
191
-*-GL-build-info.txt (Text file containing information used to create the annotation table, including tool and tool versions and date of creation)
225
192
226
-
<br>
193
+
<br>
227
194
228
-
#### *Optional*: Run the Annotations Database Creation Function as a Stand-Alone Script via R
195
+
---
196
+
197
+
### 3. Run the Annotations Database Creation Function as a Stand-Alone Script
198
+
199
+
If the reference table does not specify an annotations database for the target organism in the 'annotations' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) file, the `install_annotations` function (defined in `install-org-db.R`) will be executed by default. This function can also be run as a stand-alone script:
200
+
201
+
<br>
202
+
203
+
#### Using Singularity
229
204
230
-
If the reference table does not specify an annotations database for the target organism in the 'annotations' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) file, the `install_annotations` function (defined in `install-org-db.R`) will be executed by default. This function can also be run as a stand-alone script:
- The species name of the target organism must be specified as the first positional command line argument. `Bacillus subtilis` is used in the example above.
223
+
- The species name of the target organism must be specified as the first positional command line argument. `Bacillus subtilis` is used in both the Singularity and local R examples above.
242
224
> **Note**: The correct argument for each organism can also be found in the 'species' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv)
243
225
244
226
-*Optional*: A local reference table CSV file can be supplied as a second positional argument. If not provided, the script will download the current version of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) table by default.
0 commit comments