Skip to content

Commit dd27bde

Browse files
author
nfortelny
committed
x
1 parent 44f7fd7 commit dd27bde

File tree

1 file changed

+13
-2
lines changed

1 file changed

+13
-2
lines changed

Script_UNIX.md

Lines changed: 13 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -304,6 +304,17 @@ man wget
304304
wget http://ftp.ensembl.org/pub/release-103/fasta/homo_sapiens/cds/Homo_sapiens.GRCh38.cds.all.fa.gz
305305
```
306306

307+
If the above fails, then you can also copy the file from the resources folder into you HOME directory.
308+
``` bash
309+
cp /resources/week1/Homo_sapiens.GRCh38.cds.all.fa.gz ~/
310+
```
311+
312+
To make sure you have the entire file properly downloaded, compare the MD5 hash of the file. MD5 hash functions are a compact digital fingerprint of a file. The MD5 hash of the file should be "b16d46bf09c3b8b7909624f1e6c414ce".
313+
``` bash
314+
md5sum ~/Homo_sapiens.GRCh38.cds.all.fa.gz
315+
md5sum /resources/week1/Homo_sapiens.GRCh38.cds.all.fa.gz
316+
```
317+
307318
How large is this file?
308319
```bash
309320
du *
@@ -641,14 +652,14 @@ done < gRNAs.txt
641652

642653
#### Exercise
643654

644-
Now we will combine the above variables and loopes to test all guides in file gRNAs.txt against all DNA sequences in GRCh38_reformatted.gz. To do so:
655+
Now we will combine the above variables and loopes to test all guides in file gRNAs.txt against all DNA sequences in GRCh38_reformatted.gz. To do so you need the following steps (also see the hints below):
645656

646657
- Use the code above (under "variables") that matches guides against DNA sequences and extracts gene symbols for matched sequences.
647658
- Place this function into the loop that iterates through the file gRNAs.txt, using the while loop iterating through the file.
648659
- Store all results (not just the top 30 coming from head -30) of each guide into a file that is named: `results_${guide}.txt`. Place all files into a new folder "day4" - ideally already within the loop.
649660
- Check a few examples by hand. Do you get the right genes? Do you get the correct number of genes for the guides?
650661

651-
To complete the exercise, you can use the following code to extract genes for one guide. Note: The guide needs to be defined first using `guide=[...]`
662+
Hint: To complete the exercise, you can use the following code to extract genes for one guide. Note: The guide needs to be defined first using `guide=[...]`
652663
```bash
653664
echo $guide
654665
pattern="seq:[ACTG]*${guide}"

0 commit comments

Comments
 (0)