Skip to content

Providing a combined report for estimated counts table for Metaphlan #638

@apalleja

Description

@apalleja

Description of feature

When you run taxprofiler annotating with Metaphlan you can obtain the estimated counts by adding the following parameter: -t rel_ab_w_read_stats in db_params column in databases.csv. Then you obtain the profile files containing an extra column "estimated_number_of_reads_from_the_clade".

Image

If you run the pipeline with --run_profile_standardisation then you get a combined report: metaphlan_db1_combined_reports.txt with the relative abundances of all the samples. I think it would be nice to also get a combined report of the estimated counts. Besides when you run the pipeline with -t rel_ab_w_read_stats and -run_profile_standardisation you get this error in the taxpasta_merge (extended report in the screenshot below):

Caused by:
Process NFCORE_TAXPROFILER:TAXPROFILER:STANDARDISATION_PROFILES:TAXPASTA_MERGE (metaphlan|db1) terminated with an error exit status (1)
...

Command output:
[15:19:40] CRITICAL Error in sample merge.py:419
'250703_S1_db1.metaphlan_profile'
with profile
'250703_S1_db1.metaphlan_profile.
txt'.
CRITICAL Unexpected report format. It has 5 columns but merge.py:424
only 4 are expected.

Image

Discussed with @jfy133 at the taxprofiler Slack channel.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementImprovement for existing functionality

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions