-
Notifications
You must be signed in to change notification settings - Fork 57
Description
Description of feature
When you run taxprofiler annotating with Metaphlan you can obtain the estimated counts by adding the following parameter: -t rel_ab_w_read_stats
in db_params column in databases.csv. Then you obtain the profile files containing an extra column "estimated_number_of_reads_from_the_clade".

If you run the pipeline with --run_profile_standardisation
then you get a combined report: metaphlan_db1_combined_reports.txt with the relative abundances of all the samples. I think it would be nice to also get a combined report of the estimated counts. Besides when you run the pipeline with -t rel_ab_w_read_stats and -run_profile_standardisation you get this error in the taxpasta_merge (extended report in the screenshot below):
Caused by:
Process NFCORE_TAXPROFILER:TAXPROFILER:STANDARDISATION_PROFILES:TAXPASTA_MERGE (metaphlan|db1)
terminated with an error exit status (1)
...
Command output:
[15:19:40] CRITICAL Error in sample merge.py:419
'250703_S1_db1.metaphlan_profile'
with profile
'250703_S1_db1.metaphlan_profile.
txt'.
CRITICAL Unexpected report format. It has 5 columns but merge.py:424
only 4 are expected.

Discussed with @jfy133 at the taxprofiler Slack channel.