Skip to content

Mismatch between segment sequence and contig sequence #1492

@Martin-lc

Description

@Martin-lc

Description of bug

From contigs.path, I have:
NODE_10581_length_541_cov_2.942953
503304+

From contigs.fasta, I have:

NODE_10581_length_541_cov_2.942953
TAATATGAGTACCGCTCTTTATTTTGTAGTAAACGATGCTTTTTATCGTAAGGCTTTGCC
GAAGGCACAATTGCCGGAAGGAGTGTTGCTTGGCAGCTTGAAAGAGCTGTCTGAACAATA
TCCGGCTTTGGTCAAGCAGTATTATGGCAAGTTGGCAGATACTTCCAAGGATGGGGTGAC
CGCCTTCAATAATACTTTTGCCCAGGATGGCTTTATGCTGTATGTGCCGAAAGGCGTGGT
GGTGGACAAACCCATTCAACTGGTGAACATATTGCGTGCTGATGTTAATTTTATGGTGAA
CCGCCGTGTGCTGGTTGTGCTGGAAGAAGGTGCGCAGGCTCGTCTGTTGATTTGTGATCA
TGCCATGGATAATGTAAATTTCCTTTCTACTCAGGTTATTGAGGTCTTTGCAAAAGAAAA
TGCTACTTTCGATCTTTATGAACTGGAAGAAACCCATACCAGCACAGTGCGTTTCAGTAA
CCTCTATGTGAACCAGGAGGCAGACAGTAATGTGCTTTTGAATGGTATGACTTTGCATAA
C

From assembly_graph_after_simplification.gfa, I have:
S 503304 TGATCTCAGCTCCACGTCCGGCCAAAGTTACTTCAGTTGTATTACGCGTAGTACCTAATATGAGTACCGCTCTTTATTTTGTAGTAAACGATGCTTTTTATCGTAAGGCTTTGCCGAAGGCACAATTGCCGGAAGGAGTGTTGCTTGGCAGCTTGAAAGAGCTGTCTGAACAATATCCGGCTTTGGTCAAGCAGTATTATGGCAAGTTGGCAGATACTTCCAAGGATGGGGTGACCGCCTTCAATAATACTTTTGCCCAGGATGGCTTTATGCTGTATGTGCCGAAAGGCGTGGTGGTGGACAAACCCATTCAACTGGTGAACATATTGCGTGCTGATGTTAATTTTATGGTGAACCGCCGTGTGCTGGTTGTGCTGGAAGAAGGTGCGCAGGCTCGTCTGTTGATTTGTGATCATGCCATGGATAATGTAAATTTCCTTTCTACTCAGGTTATTGAGGTCTTTGCAAAAGAAAATGCTACTTTCGATCTTTATGAACTGGAAGAAACCCATACCAGCACAGTGCGTTTCAGTAACCTCTATGTGAACCAGGAGGCAGACAGTAATGTGCTTTTGAATGGTATGACTTTGCATAACGGTACTACGCGTAATACAACTGAAGTAACTTTGGCCGGACGTGGAGCTGAGATCA DP:f:2.94295 KC:i:1754

As you can see, the first 55 bp of segment 503304 is omited from the contig 10581 which is solely made from the segment 503304.

Is this a bug? This can be a significant issue for some files, spanning for ~2% of the contigs (>500bps) produced.

spades.log

spades.log

params.txt

params.txt

SPAdes version

SPAdes v4.0.0

Operating System

Red Hat Enterprise Linux 9.0

Python Version

Python 3.13.0

Method of SPAdes installation

conda

No errors reported in spades.log

  • Yes

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions