Skip to content
Discussion options

You must be logged in to vote

Numbers should be normalized to text, so that the acoustic model is able to spell it out. Inverse text normalization should be used on the output of the ASR model if original numbers are required (it's not a 100% match but it's usually still good).

For reference, raw numbers generally will not transcribe very well and will have very difficult to diagnose errors in the output transcript

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
1 reply
@jaffe-fly
Comment options

Answer selected by jaffe-fly
Comment options

You must be logged in to vote
2 replies
@VahidooX
Comment options

VahidooX Oct 6, 2021
Collaborator

@jaffe-fly
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants