You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[5. Run the Annotations Database Creation Function as a Stand-Alone Script](#5-run-the-annotations-database-creation-function-as-a-stand-alone-script)
12
-
-[Approach 2: Using a Local R Environment](#approach-2-using-a-local-r-environment)
13
-
-[1. Install R and Required R Packages](#1-install-r-and-required-r-packages)
14
-
-[2. Download the Workflow Files](#2-download-the-workflow-files-1)
15
-
-[3. Set Execution Permissions for Workflow Scripts](#3-set-execution-permissions-for-workflow-scripts)
16
-
-[4. Run the Workflow](#4-run-the-workflow-1)
17
-
-[5. Run the Annotations Database Creation Function as a Stand-Alone Script](#5-run-the-annotations-database-creation-function-as-a-stand-alone-script-1)
18
-
19
-
<br>
7
+
-[1. Download the Workflow Files](#1-download-the-workflow-files)
8
+
-[2. Run the Workflow](#2-run-the-workflow)
9
+
-[Approach 1: Using Singularity](#approach-1-using-singularity)
-[Step 2: Fetch the Singularity Image](#step-2-fetch-the-singularity-image)
12
+
-[Step 3: Run the Workflow](#step-3-run-the-workflow)
13
+
-[Step 4: Run the Annotations Database Creation Function as a Stand-Alone Script](#step-4-run-the-annotations-database-creation-function-as-a-stand-alone-script)
14
+
-[Approach 2: Using a Local R Environment](#approach-2-using-a-local-r-environment)
15
+
-[Step 1: Install R and Required R Packages](#step-1-install-r-and-required-r-packages)
16
+
-[Step 2: Run the Workflow](#step-2-run-the-workflow)
17
+
-[Step 3: Run the Annotations Database Creation Function as a Stand-Alone Script](#step-3-run-the-annotations-database-creation-function-as-a-stand-alone-script)
20
18
21
19
---
22
20
23
-
## General Workflow Info
24
-
25
-
The current GeneLab Reference Annotation Table (GL_RefAnnotTable-A) pipeline is implemented as an R workflow that can be run from a command line interface (CLI) using bash. The workflow can be executed using either a Apptainer (formerly Singularity) container or a local R environment. The workflow can be used even if you are unfamiliar with R, but if you want to learn more about R, visit the [R-project about page here](https://www.r-project.org/about.html). Additionally, an introduction to R along with installation help and information about using R for bioinformatics can be found [here at Happy Belly Bioinformatics](https://astrobiomike.github.io/R/basics).
21
+
## General Workflow Information
26
22
27
-
<br>
23
+
The current GeneLab Reference Annotation Table (GL_RefAnnotTable-A) pipeline is implemented as an R workflow that can be run from a command line interface (CLI) using bash. The workflow can be executed using either a Singularity container or a local R environment. The workflow can be used even if you are unfamiliar with R, but if you want to learn more about R, visit the [R-project about page here](https://www.r-project.org/about.html). Additionally, an introduction to R along with installation help and information about using R for bioinformatics can be found [here at Happy Belly Bioinformatics](https://astrobiomike.github.io/R/basics).
28
24
29
25
---
30
26
31
27
## Utilizing the Workflow
32
28
33
-
The GL_RefAnnotTable-A workflow can be run using two approaches:
To utilize the GL_RefAnnotTable-A workflow, follow the instructions below to download the necessary workflow files. Once downloaded, the workflow can be executed using two approaches:
36
30
37
-
2.**[Using a local R environment](#approach-2-using-a-local-r-environment)**.
Apptainer can be installed either through [Anaconda](https://anaconda.org/conda-forge/singularity) or as documented on the [Apptainer documentation page](https://apptainer.org/docs/admin/main/installation.html).
49
+
### 2. Run the Workflow
56
50
57
-
> **Note**: If you prefer to use Anaconda, we recommend installing Miniconda for your system, as instructed by [Happy Belly Bioinformatics](https://astrobiomike.github.io/unix/conda-intro#getting-and-installing-conda).
58
-
>
59
-
> Once conda is installed on your system, you can install Apptainer by running:
60
-
>
61
-
> ```bash
62
-
> conda install -c conda-forge apptainer
63
-
>```
51
+
The GL_RefAnnotTable-A workflow can be run using two approaches:
64
52
65
-
<br>
53
+
-**[Approach 1: Using Singularity](#approach-1-using-singularity)**
54
+
-**[Approach 2: Using a Local R Environment](#approach-2-using-a-local-r-environment)**
66
55
67
56
---
68
57
69
-
#### 2. Download the Workflow Files
58
+
#### Approach 1: Using Singularity
70
59
71
-
Download the latest version of the GL_RefAnnotTable-A workflow:
60
+
This approach allows you to run the workflow within a containerized environment, ensuring consistency and reproducibility.
Singularity is a containerization platform for running applications portably and reproducibly. We use container images hosted on Quay.io to encapsulate all the necessary software and dependencies required by the GL_RefAnnotTable-A workflow. This setup allows you to run the workflow without installing any software directly on your system. Other containerization tools like Docker or Apptainer can also be used to pull and run these images.
80
65
81
-
---
66
+
We recommend installing Singularity system-wide as per the official [Singularity installation documentation](https://docs.sylabs.io/guides/3.10/admin-guide/admin_quickstart.html).
82
67
83
-
#### 3. Fetch Apptainer Image
68
+
> **Note**: While Singularity is also available through [Anaconda](https://anaconda.org/conda-forge/singularity), we recommend installing Singularity system-wide following the official installation documentation.
84
69
85
-
To fetch the Apptainer image needed for the workflow, run:
> Note: This command should be run in the directory containing the GL_RefAnnotTable-A_1.1.0 folder downloaded in [step 2](#2-download-the-workflow-files). Depending on your network speed, this may take approximately 20 minutes.
72
+
To pull the Singularity image needed for the workflow, you can use the provided script as directed below or pull the image directly.
91
73
92
-
Once complete, an apptainer folder containing the Apptainer image will be created. Export this folder as an Apptainer configuration environment variable:
74
+
> **Note**: This command should be run in the location containing the `GL_RefAnnotTable-A_1.1.0` directory that was downloaded in [step 1](#1-download-the-workflow-files). Depending on your network speed, fetching the images will take approximately 20 minutes.
Once complete, a `singularity` folder containing the Singularity images will be created. Run the following command to export this folder as an environment variable:
99
81
100
-
---
82
+
```bash
83
+
export SINGULARITY_CACHEDIR=$(pwd)/singularity
84
+
```
101
85
102
-
####4. Run the Workflow
86
+
##### Step 3: Run the Workflow
103
87
104
-
While in the `GL_RefAnnotTable-A_1.1.0`directory, you can now run the workflow. Below is an example for generating an annotation table for Mus musculus (mouse):
88
+
While in the directory containing the `GL_RefAnnotTable-A_1.1.0`folder, you can now run the workflow. Below is an example for generating the annotation table for *Mus musculus* (mouse):
-*-GL-annotations.tsv (Tab delineated table of gene annotations)
120
104
-*-GL-build-info.txt (Text file containing information used to create the annotation table, including tool and tool versions and date of creation)
121
105
122
-
<br>
123
-
124
-
---
106
+
##### Step 4: Run the Annotations Database Creation Function as a Stand-Alone Script
125
107
126
-
#### 5. Run the Annotations Database Creation Function as a Stand-Alone Script
127
-
128
-
If the reference table does not specify an annotations database for the target organism in the annotations column, the `install_annotations` function (defined in `install-org-db.R`) will be executed. This function can also be run as a stand-alone script:
108
+
If the reference table does not specify an annotations database for the target organism in the 'annotations' column, the `install_annotations` function (defined in `install-org-db.R`) will be executed. This function can also be run as a stand-alone script:
All files required for utilizing the GL_RefAnnotTable-A workflow for generating reference annotation tables are in the [workflow_code](workflow_code) directory. To get a copy of latest GL_RefAnnotTable version on to your system, run the following command:
162
+
While in the directory containing the `GL_RefAnnotTable-A_1.1.0` folder, you can now run the workflow. Below is an example of how to run the workflow to build an annotation table for *Mus musculus* (mouse):
#### 3. Set Execution Permissions for Workflow Scripts
203
-
204
-
Once you've downloaded the GL_RefAnnotTable-A workflow directory as a zip file, unzip the workflow then `cd` into the GL_RefAnnotTable-A_1.1.0 directory on the CLI. Next, run the following command to set the execution permissions for the R script:
205
-
206
-
```bash
207
-
unzip GL_RefAnnotTable-A_1.1.0.zip
208
-
cd GL_RefAnnotTable-A_1.1.0
209
-
chmod -R u+x *R
210
-
```
211
-
212
-
<br>
213
-
214
-
---
215
-
216
-
#### 4. Run the Workflow
217
-
218
-
While in the GL_RefAnnotTable workflow directory, you are now able to run the workflow. Below is an example of how to run the workflow to build an annotation table for Mus musculus (mouse):
-*-GL-annotations.tsv (Tab delineated table of gene annotations)
232
176
-*-GL-build-info.txt (Text file containing information used to create the annotation table, including tool and tool versions and date of creation)
233
177
234
-
<br>
235
-
236
-
---
237
-
238
-
#### 5. Run the Annotations Database Creation Function as a Stand-Alone Script
178
+
##### Step 3: Run the Annotations Database Creation Function as a Stand-Alone Script
239
179
240
180
If the reference table does not specify an annotations database for the target organism in the 'annotations' column, the `install_annotations` function (defined in `install-org-db.R`) will be executed. This function can also be run as a stand-alone script:
Copy file name to clipboardExpand all lines: GeneLab_Reference_Annotations/Workflow_Documentation/GL_RefAnnotTable-A/workflow_code/bin/prepull_singularity.sh
0 commit comments