contig_to_linear_genome

OVERVIEW

These scripts are used in the process of converting a complete and circular contig to a genome whose annotation starts with dnaA.

Scripts are specifically used to:

trim_fasta.py: Convert a complete and circular contig with overlap to a linear sequence.
rearrange_genome_dnaA.py: Rearrange a linear genome sequence to begin with dnaA as the first CDS annotated

NOTE

These scripts do NOT run annotation software. They are for those fiddly steps in between running annotation software. Running these scripts requires Python.

STEP-BY-STEP PROCESS SCRIPTS SHOULD BE USED IN

Obtain .fasta file containing contig.
Determine that contig is circular and at which bp location you want to trim the sequence. Recommend using Gepard for this.
Trim the sequence by running trim_fasta.py.
Run your favorite annotation program that generates a .gbk file.
Confirm that dnaA is NOT the first CDS in the annotation.
Determine how many bp upstream of dnaA should be the start of the rearranged genome.
Rearrange the genome by running rearrange_genome_dnaA.py.
Re-run your favorite annotation program.

COMMANDS TO RUN SCRIPTS

Both scripts MUST take in two arguments as shown below:

python trim_fasta.py [.fasta file] [integer]

First argument should be a (.fasta) file containing the complete circular contig. File should have only one fasta sequence in it. Second argument should be an integer that is the bp location the sequence should be trimmed at to make the linear representation of the genome.

python rearrange_genome_dnaA.py [.gbk file] [integer]

First argument is a genbank (.gbk) file from your first annotation run that does not start with the dnaA gene as the first CDS. Second argument should be an integer that is the number of bp upstream from the dnaA start position that you want in the rearranged representation of the genome. This will be where the rearranged linear representation of the genome starts.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
rearrange_genome_dnaA.py		rearrange_genome_dnaA.py
trim_fasta.py		trim_fasta.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

contig_to_linear_genome

OVERVIEW

NOTE

STEP-BY-STEP PROCESS SCRIPTS SHOULD BE USED IN

COMMANDS TO RUN SCRIPTS

About

Uh oh!

Releases

Packages

Languages

MarescaLab/contig_to_linear_genome

Folders and files

Latest commit

History

Repository files navigation

contig_to_linear_genome

OVERVIEW

NOTE

STEP-BY-STEP PROCESS SCRIPTS SHOULD BE USED IN

COMMANDS TO RUN SCRIPTS

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages