SISRS v2.0 SNP Identification from Short Read Sequences

SISRS — pronounced “scissors” — is a program for identifying phylogenetically informative sites from next-generation whole-genome sequencing of multiple species. It identifies homologous sites without the need to do de novo assembly, annotation, and alignment. It identifies conserved regions by doing joint de novo assembly on multiple species. Sequencing reads are then aligned back to the contigs to identify variable sites.

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version. You may use and modify this software so long as you acknowledge its authors, list changes you have made, and continue to use the GPL3 license.

This program is distributed in the hope that it will be useful, but without any warranty; without even the implied warranty of merchantability or fitness for a particular purpose. See the GNU General Public License for more details.

Tutorial

For sample data unzip SISRS_Small.zip

On an HPC using slurm, run the scripts in the slurm_example_scripts folder in numerical order. When using your own data ensure that the path to sisrs is correct in your scripts.

01 - set up folders
02 - trim reads
03 - subset the reads down to a total of 10x to do the composite genome assembly
04 - do the composite genome assembly (with Ray)
05 - set up folders to output data for each species and index composite genome
06 - align individual species to composite genome
07 - get pileup for each species (ie info for each site)
08 - realign for more data per species
09 - get pileup for each species (ie info for each site)
10 - get nexus alignment of variable sites
11 - get consensus for each site and filter by allelic coverage and ratio of heterozygous sites
12 - align these contigs
13 - filter contigs based on distance from composite, length, number of variable sites
14 - use blast to remove similar contigs and any with many spp with high heterozygosity

Note: to pick up an analysis from the middle but moving to a new folder (i.e. leaving the original analysis intact):

run slurm script 01
run 02 adding --link to the sisrs command to copy over prior results
run 04 adding --link [prior run folder]
run 05 adding --link [prior run folder]
run 06 (array or not) adding --link [prior run folder]
run 10 adding --link [prior run folder]

Support and Communication

If you have any questions about the software, please feel free to reach out to us on our github issues page @ SISRS Github.

For other forms of communication we invite you to go to our lab's personal website @ Schwartz Lab.

Name		Name	Last commit message	Last commit date
Latest commit History 566 Commits
scripts		scripts
slurm_example_scripts		slurm_example_scripts
test_data		test_data
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SISRS_Small.zip		SISRS_Small.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SISRS v2.0 SNP Identification from Short Read Sequences

License

Tutorial

Support and Communication

About

Uh oh!

Releases 2

Packages

Contributors 7

Languages

License

SchwartzLabURI/SISRS

Folders and files

Latest commit

History

Repository files navigation

SISRS v2.0 SNP Identification from Short Read Sequences

License

Tutorial

Support and Communication

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 7

Languages

Packages