Skip to content

Feature Request: Add Confidence Intervals for Motif Counts (MC) #66

@nlfortier

Description

@nlfortier

The current tandem repeat genotyping tool outputs motif counts (MC) in VCF files but does not provide confidence intervals for these estimates. This would be a valuable addition since:

  • Confidence intervals help users assess call reliability
  • This feature exists in other STR callers (e.g., ExpansionHunter's REPCI field)
  • It would improve consistency with community standards

Example from ExpansionHunter

##FORMAT=<ID=REPCI,Number=1,Type=String,Description="Confidence interval for REPCN">
##FORMAT=<ID=REPCN,Number=1,Type=String,Description="Number of repeat units spanned by the allele">

In the VCF records:

REPCN:REPCI 2/10:2-4/9-12

Suggested Implementation:

  • Add a new FORMAT field (e.g., MCCI) for motif count confidence intervals
  • Format as two integers (min,max) per genotype like ExpansionHunter's REPCI
  • Document the confidence level used (e.g., 95% CI)

Benefits:

  • Enables better quality filtering
  • Facilitates comparison with other tools
  • Helps clinical/lab users make informed decisions

Would this be feasible to implement in a future release?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions