Skip to content

homopolymer indel weighting #11

@nikosdarzentas

Description

@nikosdarzentas

Hi Nick,

This is my 3rd report and the most important so far. It's about homopolymers, on Version 2.6.5 (all the issues I opened are on this version).

In the example below I would expect the homopolymer-based indel (two extra 't' in 1st sequence - and I guess the minimum homopolymer length is 3?) to be called (i.e. sequences clustered), because my understanding is that it should be weighted down to below 1, in particular 1/((5+3)/2) = 1/4 = 0.25 based on the manuscript - or 0.5 if you're counting this as 2 events?

Any insights?

stopCheck:smallCutoff:1baseIndel:2baseIndel:>2baseIndel:HQMismatches:LQMismatches:LKMismatches
9999999:0:1:0:0:0:0:0

@1;size=1
acgtttttACGTacgt
+
********HHHH****
@2;size=1
acgtttACGTacgt
+
******HHHH****

SeekDeep qluster --fastq qlutest.fastq --out qlutest --par myPar --noMarkChimeras --lower keep --caseInsensitive --smallReadSize 0 --useAllInput --writeOutInitalSeqs --overWrite --verbose --fastClustering --nucCutOff 0.2 --runCutOff 0%,0 --adjustHomopolyerRuns false --qualThresWindow 0

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions