Skip to content

Improving performance and question about the output files from the specificity command #38

@charlesfoster

Description

@charlesfoster

Hello,

Thank you for this tool.

I generated a tiling scheme using:

  • an input MSA with hundreds of genomes from NCBI for my virus of interest
  • a BLAST database of non-specific sequences with many genomes of common circulating viruses + the human genome

In order, as directed in the README.md I ran 1) olivar build, 2) olivar tiling, 3) olivar specificity. The output naively looked good, i.e. ~96% coverage of my MSA of interest and the olivar-specificity_pool-1_ns-amp.csv and olivar-specificity_pool-2_ns-amp.csv files were empty. Accordingly, we ordered the primers to test on some clinical cases, but the results weren't good, e.g. a maximum of ~45% coverage even on samples with a very high viral load (as confirmed with qCR) and amplification of human sequences.

I'm now trying to figure out why the primers didn't work well in the real world as opposed to in silico. One thing I noticed was that despite the *_ns-amp.csv files being empty, the olivar-specificity_pool-*.csv files have some to many non-zero values in the 'BLAST_hits' column.

Q) Is this a red flag or is it normal?
Q) Am I correct that these represent blast hits against the non-specific sequences, or are they actually hits against the target MSA? If the latter, is it instead bad to see a value of zero for some rows?

Otherwise, do you have any other general tips for a potential re-design? For example, might I have used too many input sequences in the MSA? I thought it seemed like it still worked OK because of the good 96% coverage of the reference MSA in the logfile.

Exact commands used:

makeblastdb -in non_target.fasta -dbtype nucl -title non_target -parse_seqids -hash_index -out non_target  -logfile makeblastdb.out
mafft --auto --preservecase --reorder --adjustdirection --thread 8 target_virus.no_ambig.fasta > target_virus.no_ambig.aligned.fasta # followed by manual verification
olivar build --msa target_virus.no_ambig.aligned.fasta --db non_target --output olivar_target --title olivar_target --threads 12
olivar tiling olivar_target/olivar_target.olvr -o olivar_target --max-amp-len 1200 --min-amp-len 800 --check-var --seed 10 -p 12
olivar specificity olivar_target/olivar-design.csv --max-amp-len 1500 --pool 1 -d non_target -o olivar_target/ -p12
olivar specificity olivar_target/olivar-design.csv --max-amp-len 1500 --pool 2 -d non_target -o olivar_target/ -p12

Thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions