Skip to content

Commit 9322066

Browse files
committed
updated README.md
1 parent b30155f commit 9322066

File tree

2 files changed

+14
-13
lines changed

2 files changed

+14
-13
lines changed

README.md

Lines changed: 9 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -29,17 +29,22 @@ The mapped fragments are given to a script that filters ambiguously
2929
mapped fragments:
3030
```
3131
./filterAlnScoreAndQual.py -i mapped_smurf_reads.sam \
32-
-o unambig_smurf_frags.sam -s 120 -q 1
32+
-o unambig_smurf_frags.sam -s 120 -q 1
3333
```
3434

35+
The input file `mapped_smurf_reads.sam` is just the mapped reads
36+
(e.g. with BWA). The output file `unambig_smurf_frags.sam`
37+
contains mapped fragments with mapping quality greater than or equal
38+
to 1.
39+
3540
Then the remaining fragments are given to a script that obtains
3641
the counts of reads in bins:
3742
```
3843
./getBinCounts.py -i unambig_smurf_frags.sam -c hg19.chrom.sizes \
3944
-b bins_5k_hg19.bed -o bin_counts.bed -s bin_stats.txt
4045
```
41-
The input file `mapped_smurf_reads.sam` is just the mapped reads
42-
(e.g. with BWA). The file `hg19.chrom.size` is the size of all chroms
46+
The input file `unambig_smurf_frags.sam` is the same as described above.
47+
The file `hg19.chrom.size` is the size of all chroms
4348
in the reference genome. This file for the hg19 reference is supplied
4449
in the `data` directory in this repo, and was obtained from the UCSC
4550
Genome Browser's database. The pre-defined bins file `bins_5k_hg19.bed`
@@ -164,7 +169,7 @@ well it recovers the known mapping locations. The steps are as follows.
164169
`bedtools` (we used v2.26.0). We also require the `deadzones`
165170
program from `http://github.com/smithlabcode/utils` but this could
166171
be substituted for any means of excluding unmappable regions.
167-
In our CNV analysis, we used `bwa` (version...).
172+
In our CNV analysis, we used `bwa` (0.7.17).
168173

169174
## Contacts and Bug Reports
170175

test/README.md

Lines changed: 5 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,6 @@
1-
## To test mapping a simulated SMURF-seq read to a
2-
## small simulated genome
3-
../map/smurfseq-map.sh map/ref_sim.fa map/read_sim.fastq
1+
## To test mapping simulated SMURF-seq reads to a small simulated genome
2+
A simulated reference genome `ref_sim.fa` has been indexed with BWA-MEM.
3+
`read_sim.fastq` contains simulated reads.
44

5-
## Generate CNV profile
6-
../cnv/cnv-5k.sh map/read_sim.fastq.2x.sam test_cnv
7-
8-
## To test mapping a simulated-read to the human
9-
## genome
10-
../map/smurfseq-map.sh [path to human genome] map/read_hg.fastq
5+
## To test mapping simulated reads to the human genome
6+
`read_hg.fastq` contains simulated reads from the human genome.

0 commit comments

Comments
 (0)