Skip to content

Poly(A) tail estimation post basecalling #1475

@danielcav

Description

@danielcav

New issue checks

Dorado subcommand

Summary

Feature request

Hello,

I would like to estimate poly(A) tail lengths from my nanopore data. I noticed that the basecaller has the --estimate-poly-a option, but currently this has to be specified at basecalling time. Since basecalling large datasets can take more than a day, it would be very useful to have the option to run poly(A) estimation after basecalling, without having to re-run the entire basecalling process.

Does this functionality already exist in Dorado?

If not, would it be feasible to add a standalone poly(A) estimation step that works on existing Dorado outputs (FASTQ/BAM/POD5)?

I have also looked into Nanopolish and Tailfindr, but both rely on FAST5 input, which is now obsolete, I guess. Are there any existing tools, or plans to adapt similar approaches, that can work directly with POD5 and Dorado outputs?

Thank you very much for your time and for developing Dorado!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions