Skip to content

Commit 73f1f39

Browse files
committed
update documentation
1 parent f1e6fe3 commit 73f1f39

File tree

5 files changed

+42
-44
lines changed

5 files changed

+42
-44
lines changed

doc/SNPsea_manual.html

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -619,11 +619,11 @@ <h1 id="introduction"><a href="#introduction">Introduction</a></h1>
619619
<p>We assume that a gene's specificity to a given cell type or condition is a reasonable indicator of the gene's importance to its function.</p>
620620
<p>If you benefit from this method, please cite:</p>
621621
<blockquote>
622-
<p>Slowikowski, K. et al. SNPsea: an algorithm to identify cell types, tissues, and pathways affected by risk loci &gt; Manuscript in progress.</p>
622+
<p>Slowikowski, K. et al. <strong>SNPsea: an algorithm to identify cell types, tissues, and pathways affected by risk loci.</strong> Manuscript in progress.</p>
623623
</blockquote>
624624
<p>See additional examples:</p>
625625
<blockquote>
626-
<p>Hu, X. et al. Integrating autoimmune risk loci with gene-expression data identifies specific pathogenic immune cell subsets. The American Journal of Human Genetics 89, 496–506 (2011). <a href="http://www.ncbi.nlm.nih.gov/pubmed/21963258">PubMed</a></p>
626+
<p>Hu, X. et al. <strong>Integrating autoimmune risk loci with gene-expression data identifies specific pathogenic immune cell subsets.</strong> The American Journal of Human Genetics 89, 496–506 (2011). <a href="http://www.ncbi.nlm.nih.gov/pubmed/21963258">PubMed</a></p>
627627
</blockquote>
628628
<h2 id="contact"><a href="#contact">Contact</a></h2>
629629
<p>Please contact me with questions and comments: <script type="text/javascript">
@@ -842,7 +842,7 @@ <h1 id="usage"><a href="#usage">Usage</a></h1>
842842
<h2 id="example"><a href="#example">Example</a></h2>
843843
<p>Here is a <a href="http://www.gnu.org/software/bash/manual/bashref.html">Bash</a> script with a usage example:</p>
844844
<pre class="sourceCode bash"><code class="sourceCode bash"><span class="ot">options=(</span>
845-
<span class="kw">--snps</span> LDL_Teslovich2010.txt
845+
<span class="kw">--snps</span> Red_blood_cell_count-Harst2012-45_SNPs.gwas
846846
<span class="kw">--gene-matrix</span> GeneAtlas2004.gct.gz
847847
<span class="kw">--gene-intervals</span> NCBIgenes2013.bed.gz
848848
<span class="kw">--snp-intervals</span> TGP2011.bed.gz
@@ -851,11 +851,11 @@ <h2 id="example"><a href="#example">Example</a></h2>
851851
<span class="kw">--slop</span> 10e3
852852
<span class="kw">--threads</span> 4
853853
<span class="kw">--null-snpsets</span> 0
854-
<span class="kw">--min-observations</span> 50
855-
<span class="kw">--max-iterations</span> 1e6
854+
<span class="kw">--min-observations</span> 100
855+
<span class="kw">--max-iterations</span> 1e7
856856
)
857857
<span class="kw">snpsea</span> <span class="ot">${options[*]}</span></code></pre>
858-
<p>This will run the analysis on SNPs associated with LDL cholesterol and test for tissue-specific expression of the nearby genes across 79 human tissues in the Gene Atlas gene expression matrix. Additionally, 1000 null random matched SNP sets will be tested and their results will also be recorded. Each tissue will be tested up to 1 million times, or testing will stop for a tissue if 50 matched SNP sets are observed to achieve a higher specificity score than the user's SNPs.</p>
858+
<p>SNPs will test SNPs associated with Red blood cell count for tissue-specific expression of linked genes across 79 human tissues in the Gene Atlas expression matrix. Each tissue will be tested up to 10 million times with matched random SNP sets, or testing will stop for a tissue if 100 matched SNP sets achieve a higher specificity score than the user's SNPs.</p>
859859
<h2 id="options"><a href="#options">Options</a></h2>
860860
<p>All input files may optionally be compressed with <a href="http://www.gzip.org/"><code>gzip</code></a>.</p>
861861
<h3 id="required"><a href="#required">Required</a></h3>
@@ -917,18 +917,18 @@ <h3 id="optional"><a href="#optional">Optional</a></h3>
917917
<h2 id="input-file-formats"><a href="#input-file-formats">Input File Formats</a></h2>
918918
<h3 id="snps-arg"><a href="#snps-arg"><code>--snps ARG</code></a></h3>
919919
<p>You must provide one or more comma-separated text files. SNP identifiers must be listed one per line. Only the first column is used.</p>
920-
<pre><code>head LDL_Teslovich2010.txt
921-
922-
rs11136341 chr8 145043543
923-
rs3757354 chr6 16127407
924-
rs12027135 chr1 25775733
925-
rs217386 chr7 44600695
926-
rs1169288 chr12 121416650
927-
rs7225700 chr17 45391804
928-
rs2479409 chr1 55504650
929-
rs247616 chr16 56989590
930-
rs2954022 chr8 126482621
931-
rs1564348 chr6 160578860</code></pre>
920+
<pre><code>head Red_blood_cell_count-Harst2012-45_SNPs.gwas
921+
922+
# Harst et al. 2012
923+
# doi:10.1038/nature11677
924+
# PMID: 23222517
925+
# 45 SNPs associated with red blood cell count (RBC) taken from Table 1.
926+
# Positions are on hg19. SNPs are included if P &lt;= 5e-8.
927+
CHR POS SNP P
928+
chr1 40069939 rs3916164 3e-10
929+
chr1 158575729 rs857684 4e-16
930+
chr1 199007208 rs7529925 8e-09
931+
chr1 248039451 rs3811444 5e-10</code></pre>
932932
<p>Instead of providing a file with SNPs, you may use &quot;randomN&quot; like this:</p>
933933
<pre><code>--snps random20</code></pre>
934934
<p>to sample 20 random SNPs from the <strong><code>--snp-intervals</code></strong> file.</p>

doc/SNPsea_manual.md

Lines changed: 23 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -37,13 +37,13 @@ a reasonable indicator of the gene's importance to its function.
3737

3838
If you benefit from this method, please cite:
3939

40-
> Slowikowski, K. et al. SNPsea: an algorithm to identify cell types, tissues,
41-
> and pathways affected by risk loci > Manuscript in progress.
40+
> Slowikowski, K. et al. **SNPsea: an algorithm to identify cell types,
41+
> tissues, and pathways affected by risk loci.** Manuscript in progress.
4242
4343
See additional examples:
4444

45-
> Hu, X. et al. Integrating autoimmune risk loci with gene-expression data
46-
> identifies specific pathogenic immune cell subsets. The American Journal
45+
> Hu, X. et al. **Integrating autoimmune risk loci with gene-expression data
46+
> identifies specific pathogenic immune cell subsets.** The American Journal
4747
> of Human Genetics 89, 496–506 (2011). [PubMed][Hu2011]
4848
4949

@@ -442,7 +442,7 @@ Here is a [Bash] script with a usage example:
442442

443443
```bash
444444
options=(
445-
--snps LDL_Teslovich2010.txt
445+
--snps Red_blood_cell_count-Harst2012-45_SNPs.gwas
446446
--gene-matrix GeneAtlas2004.gct.gz
447447
--gene-intervals NCBIgenes2013.bed.gz
448448
--snp-intervals TGP2011.bed.gz
@@ -451,19 +451,17 @@ options=(
451451
--slop 10e3
452452
--threads 4
453453
--null-snpsets 0
454-
--min-observations 50
455-
--max-iterations 1e6
454+
--min-observations 100
455+
--max-iterations 1e7
456456
)
457457
snpsea ${options[*]}
458458
```
459459

460-
This will run the analysis on SNPs associated with LDL cholesterol and
461-
test for tissue-specific expression of the nearby genes across 79 human
462-
tissues in the Gene Atlas gene expression matrix. Additionally, 1000 null
463-
random matched SNP sets will be tested and their results will also be
464-
recorded. Each tissue will be tested up to 1 million times, or testing will
465-
stop for a tissue if 50 matched SNP sets are observed to achieve a higher
466-
specificity score than the user's SNPs.
460+
SNPs will test SNPs associated with Red blood cell count for tissue-specific
461+
expression of linked genes across 79 human tissues in the Gene Atlas expression
462+
matrix. Each tissue will be tested up to 10 million times with matched random
463+
SNP sets, or testing will stop for a tissue if 100 matched SNP sets achieve a
464+
higher specificity score than the user's SNPs.
467465

468466

469467

@@ -551,18 +549,18 @@ You must provide one or more comma-separated text files. SNP identifiers must
551549
be listed one per line. Only the first column is used.
552550

553551
```
554-
head LDL_Teslovich2010.txt
552+
head Red_blood_cell_count-Harst2012-45_SNPs.gwas
555553
556-
rs11136341 chr8 145043543
557-
rs3757354 chr6 16127407
558-
rs12027135 chr1 25775733
559-
rs217386 chr7 44600695
560-
rs1169288 chr12 121416650
561-
rs7225700 chr17 45391804
562-
rs2479409 chr1 55504650
563-
rs247616 chr16 56989590
564-
rs2954022 chr8 126482621
565-
rs1564348 chr6 160578860
554+
# Harst et al. 2012
555+
# doi:10.1038/nature11677
556+
# PMID: 23222517
557+
# 45 SNPs associated with red blood cell count (RBC) taken from Table 1.
558+
# Positions are on hg19. SNPs are included if P <= 5e-8.
559+
CHR POS SNP P
560+
chr1 40069939 rs3916164 3e-10
561+
chr1 158575729 rs857684 4e-16
562+
chr1 199007208 rs7529925 8e-09
563+
chr1 248039451 rs3811444 5e-10
566564
```
567565

568566
Instead of providing a file with SNPs, you may use "randomN" like this:

doc/SNPsea_manual.pdf

558 Bytes
Binary file not shown.

doc/SNPsea_supplement.lyx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1187,7 +1187,7 @@ snpsea
11871187

11881188
\family typewriter
11891189
\size small
1190-
--snps LDL_Teslovich2010.txt
1190+
--snps HDL_Teslovich2010.txt
11911191
\end_layout
11921192

11931193
\begin_layout Plain Layout
Loading

0 commit comments

Comments
 (0)