Skip to content

Commit f54a529

Browse files
Formatting updates
1 parent 75dd660 commit f54a529

File tree

1 file changed

+37
-53
lines changed
  • GeneLab_Reference_Annotations/Workflow_Documentation/GL_RefAnnotTable-A

1 file changed

+37
-53
lines changed

GeneLab_Reference_Annotations/Workflow_Documentation/GL_RefAnnotTable-A/README.md

Lines changed: 37 additions & 53 deletions
Original file line numberDiff line numberDiff line change
@@ -10,11 +10,13 @@
1010
- [Step 1: Install Singularity](#step-1-install-singularity)
1111
- [Step 2: Fetch the Singularity Image](#step-2-fetch-the-singularity-image)
1212
- [Step 3: Run the Workflow](#step-3-run-the-workflow)
13-
- [Optional: Run the Annotations Database Creation Function as a Stand-Alone Script via Singularity](#optional-run-the-annotations-database-creation-function-as-a-stand-alone-script-via-singularity)
1413
- [Approach 2: Using a Local R Environment](#approach-2-using-a-local-r-environment)
1514
- [Step 1: Install R and Required R Packages](#step-1-install-r-and-required-r-packages)
1615
- [Step 2: Run the Workflow](#step-2-run-the-workflow)
17-
- [Optional: Run the Annotations Database Creation Function as a Stand-Alone Script via R](#optional-run-the-annotations-database-creation-function-as-a-stand-alone-script-via-r)
16+
- [Workflow Input/Output Data](#workflow-input-output-data)
17+
- [3. Run the Annotations Database Creation Function as a Stand-Alone Script](#3-run-the-annotations-database-creation-function-as-a-stand-alone-script)
18+
- [Using Singularity](#using-singularity)
19+
- [Using a Local R Environment](#using-a-local-r-environment)
1820

1921
<br>
2022

@@ -93,62 +95,19 @@ Once complete, a `singularity` folder containing the Singularity images will be
9395
```bash
9496
export SINGULARITY_CACHEDIR=$(pwd)/singularity
9597
```
98+
9699
<br>
97100

98101
#### Step 3: Run the Workflow
99102

100-
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder, you can now run the workflow. Below is an example for generating the annotation table for *Mus musculus* (mouse):
103+
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder that was downloaded in [step 1](#1-download-the-workflow-files), you can now run the workflow. Below is an example for generating the annotation table for *Mus musculus* (mouse):
101104

102105

103106
```bash
104107
singularity exec -B $(pwd)/GL_RefAnnotTable-A_1.1.0:/work \
105108
$SINGULARITY_CACHEDIR/quay.io-nasa_genelab-gl-refannottable-a-1.1.0.img \
106109
Rscript /work/GL-DPPD-7110-A_build-genome-annots-tab.R 'Mus musculus'
107110
```
108-
<br>
109-
110-
**Input data:**
111-
112-
- No input files are required. Specify the species name of the target organism using a positional command line argument. `Mus musculus` is used in the example above.
113-
> **Notes**:
114-
> - To see a list of all available organisms, run `Rscript GL-DPPD-7110-A_build-genome-annots-tab.R` without positional arguments.
115-
> - The correct argument for each organism can also be found in the 'species' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv)
116-
117-
- *Optional*: A local reference table CSV file can be supplied as a second positional argument. If not provided, the script will download the current version of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) table by default.
118-
119-
120-
**Output data:**
121-
122-
- *-GL-annotations.tsv (Tab delineated table of gene annotations)
123-
124-
- *-GL-build-info.txt (Text file containing information used to create the annotation table, including tool and tool versions and date of creation)
125-
126-
<br>
127-
128-
#### *Optional*: Run the Annotations Database Creation Function as a Stand-Alone Script via Singularity
129-
130-
If the reference table does not specify an annotations database for the target organism in the 'annotations' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) file, the `install_annotations` function (defined in `install-org-db.R`) will be executed by default. This function can also be run as a stand-alone script:
131-
132-
133-
```bash
134-
singularity exec -B $(pwd)/GL_RefAnnotTable-A_1.1.0:/work \
135-
$SINGULARITY_CACHEDIR/quay.io-nasa_genelab-gl-refannottable-a-1.1.0.img \
136-
Rscript /work/install-org-db.R 'Bacillus subtilis'
137-
```
138-
139-
<br>
140-
141-
**Input data:**
142-
143-
- The species name of the target organism must be specified as the first positional command line argument. `Bacillus subtilis` is used in the example above.
144-
> **Note**: The correct argument for each organism can also be found in the 'species' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv)
145-
146-
- *Optional*: A local reference table CSV file can be supplied as a second positional argument. If not provided, the script will download the current version of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) table by default.
147-
148-
149-
**Output data:**
150-
151-
- org.*.eg.db/ (Species-specific annotation database, as a local R package)
152111

153112
<br>
154113

@@ -198,7 +157,7 @@ BiocManager::install("GO.db")
198157

199158
#### Step 2: Run the Workflow
200159

201-
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder, you can now run the workflow. Below is an example of how to run the workflow to build an annotation table for *Mus musculus* (mouse):
160+
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder that was downloaded in [step 1](#1-download-the-workflow-files), you can now run the workflow. Below is an example of how to run the workflow to build an annotation table for *Mus musculus* (mouse):
202161

203162

204163
```bash
@@ -207,9 +166,17 @@ Rscript GL_RefAnnotTable-A_1.1.0/GL-DPPD-7110-A_build-genome-annots-tab.R 'Mus m
207166

208167
<br>
209168

169+
---
170+
171+
### Workflow Input/Output Data
172+
173+
The input and output data are the same for both [Approach 1: Using Singularity](#approach-1-using-singularity) and [Approach 2: Using a Local R Environment](#approach-2-using-a-local-r-environment).
174+
175+
<br>
176+
210177
**Input data:**
211178

212-
- No input files are required. Specify the species name of the target organism using a positional command line argument. `Mus musculus` is used in the example above.
179+
- No input files are required. Specify the species name of the target organism using a positional command line argument. `Mus musculus` is used in both the Singularity and the local R environment examples above.
213180
> **Notes**:
214181
> - To see a list of all available organisms, run `Rscript GL-DPPD-7110-A_build-genome-annots-tab.R` without positional arguments.
215182
> - The correct argument for each organism can also be found in the 'species' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv)
@@ -223,12 +190,27 @@ Rscript GL_RefAnnotTable-A_1.1.0/GL-DPPD-7110-A_build-genome-annots-tab.R 'Mus m
223190

224191
- *-GL-build-info.txt (Text file containing information used to create the annotation table, including tool and tool versions and date of creation)
225192

226-
<br>
193+
<br>
227194

228-
#### *Optional*: Run the Annotations Database Creation Function as a Stand-Alone Script via R
195+
---
196+
197+
### 3. Run the Annotations Database Creation Function as a Stand-Alone Script
198+
199+
If the reference table does not specify an annotations database for the target organism in the 'annotations' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) file, the `install_annotations` function (defined in `install-org-db.R`) will be executed by default. This function can also be run as a stand-alone script:
200+
201+
<br>
202+
203+
#### Using Singularity
229204

230-
If the reference table does not specify an annotations database for the target organism in the 'annotations' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) file, the `install_annotations` function (defined in `install-org-db.R`) will be executed by default. This function can also be run as a stand-alone script:
205+
```bash
206+
singularity exec -B $(pwd)/GL_RefAnnotTable-A_1.1.0:/work \
207+
$SINGULARITY_CACHEDIR/quay.io-nasa_genelab-gl-refannottable-a-1.1.0.img \
208+
Rscript /work/install-org-db.R 'Bacillus subtilis'
209+
```
231210

211+
<br>
212+
213+
#### Using a Local R Environment
232214

233215
```bash
234216
Rscript GL_RefAnnotTable-A_1.1.0/install-org-db.R 'Bacillus subtilis'
@@ -238,7 +220,7 @@ Rscript GL_RefAnnotTable-A_1.1.0/install-org-db.R 'Bacillus subtilis'
238220

239221
**Input data:**
240222

241-
- The species name of the target organism must be specified as the first positional command line argument. `Bacillus subtilis` is used in the example above.
223+
- The species name of the target organism must be specified as the first positional command line argument. `Bacillus subtilis` is used in both the Singularity and local R examples above.
242224
> **Note**: The correct argument for each organism can also be found in the 'species' column of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv)
243225
244226
- *Optional*: A local reference table CSV file can be supplied as a second positional argument. If not provided, the script will download the current version of the [GL-DPPD-7110-A_annotations.csv](../../Pipeline_GL-DPPD-7110_Versions/GL-DPPD-7110-A/GL-DPPD-7110-A_annotations.csv) table by default.
@@ -248,4 +230,6 @@ Rscript GL_RefAnnotTable-A_1.1.0/install-org-db.R 'Bacillus subtilis'
248230

249231
- org.*.eg.db/ (Species-specific annotation database, as a local R package)
250232

233+
<br>
234+
251235
---

0 commit comments

Comments
 (0)