Performance metrics for evaluating time segmentation of Whisper models #182
Unanswered
mafaisalpg
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Yes, we studied how to evaluate timestamps quality. Unfortunately there is no open source code to compute those metrics. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone,
I like to compare different Whisper models on my own datasets changing some parameters like detect_disfluencies, etc. My output looks like
start, end, text
instead of word level timestamp.
Questions
What are metrics (MSE, MAE, etc.) best fit for this purpose?
Is there any Python implementation which I can reuse?
Is there any paper or article have done such comparison?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions