Skip to content

prwoolley/IgG-indel-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IgG-indel-detection

Indel detection software designed to be used on MiXCR outputs

Inputs are two files: Assembled-annotation and Proteomic-database

The pipeline is outlined as follows:

  1. Creates directories output/fastas, output/mmseqs, output/igblast, where "output" is a user defined directory.
  2. Pull cluster rep sequences from Proteomic-database and write to FASTA file.
  3. Pull sequences from Assembled-annotation and write to FASTA file.
  4. Create MMseqs database for the Assembled-annotation FASTA.
  5. Create MMseqs database for the Proteomic-database FASTA.
  6. Search Assembled-annotation database against Proteomic-database. Write alignments to m8 file.
  7. Load m8 file and original Proteomic-databases. For each cluster, compute the number of peaks along the sequence length axis. Flag any clusters of a certain cluster size threshold that had an increase in the number of peaks. Open the Assembled-annotation file. Write any sequence from the flagged clusters to an abbreviated Assembled-annotation file and FASTA file.
  8. Run IgBLAST on the abbreviated Assembled-annotation FASTA file.
  9. Run insertion_finder.py on IgBLAST MSAs.

About

Indel detection software designed to be used on MiXCR outputs

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •