-
https://github.com/NVIDIA/NeMo/blob/main/examples/asr/conf/quartznet/quartznet_15x5_zh.yaml why yaml labels has no number (1,2,3....) when number in the datasets,do i need remove? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
Numbers should be normalized to text, so that the acoustic model is able to spell it out. Inverse text normalization should be used on the output of the ASR model if original numbers are required (it's not a 100% match but it's usually still good). For reference, raw numbers generally will not transcribe very well and will have very difficult to diagnose errors in the output transcript |
Beta Was this translation helpful? Give feedback.
-
when dump to json file do i need set like this |
Beta Was this translation helpful? Give feedback.
Numbers should be normalized to text, so that the acoustic model is able to spell it out. Inverse text normalization should be used on the output of the ASR model if original numbers are required (it's not a 100% match but it's usually still good).
For reference, raw numbers generally will not transcribe very well and will have very difficult to diagnose errors in the output transcript