how process Mandarin ASR datasets? #2913

jaffe-fly · 2021-09-28T07:10:50Z

jaffe-fly
Sep 28, 2021

https://github.com/NVIDIA/NeMo/blob/main/examples/asr/conf/quartznet/quartznet_15x5_zh.yaml

why yaml labels has no number （1,2,3....）

when number in the datasets,do i need remove?

Answered by titu1994

Sep 28, 2021

Numbers should be normalized to text, so that the acoustic model is able to spell it out. Inverse text normalization should be used on the output of the ASR model if original numbers are required (it's not a 100% match but it's usually still good).

For reference, raw numbers generally will not transcribe very well and will have very difficult to diagnose errors in the output transcript

View full answer

titu1994 · 2021-09-28T08:17:08Z

titu1994
Sep 28, 2021
Maintainer

Numbers should be normalized to text, so that the acoustic model is able to spell it out. Inverse text normalization should be used on the output of the ASR model if original numbers are required (it's not a 100% match but it's usually still good).

For reference, raw numbers generally will not transcribe very well and will have very difficult to diagnose errors in the output transcript

1 reply

jaffe-fly Sep 28, 2021
Author

Numbers should be normalized to text, so that the acoustic model is able to spell it out. Inverse text normalization should be used on the output of the ASR model if original numbers are required (it's not a 100% match but it's usually still good).

For reference, raw numbers generally will not transcribe very well and will have very difficult to diagnose errors in the output transcript
thank you !!!
which tutorials can i see Numbers normalized to text and Inverse text normalization should be used on the output of the ASR model ?
I dont know how to do this.

jaffe-fly · 2021-09-28T08:57:53Z

jaffe-fly
Sep 28, 2021
Author

when dump to json file do i need set like this json.dump(metadata, fout, ensure_ascii=False),ensure_ascii=False in the func,?

2 replies

VahidooX Oct 6, 2021
Collaborator

I don't think you need to set ensure_ascii=False.

jaffe-fly Oct 8, 2021
Author

ok thank you

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

how process Mandarin ASR datasets? #2913

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

how process Mandarin ASR datasets? #2913

Uh oh!

jaffe-fly Sep 28, 2021

Replies: 2 comments · 3 replies

Uh oh!

Uh oh!

titu1994 Sep 28, 2021 Maintainer

Uh oh!

jaffe-fly Sep 28, 2021 Author

Uh oh!

jaffe-fly Sep 28, 2021 Author

Uh oh!

VahidooX Oct 6, 2021 Collaborator

Uh oh!

jaffe-fly Oct 8, 2021 Author

jaffe-fly
Sep 28, 2021

Replies: 2 comments 3 replies

titu1994
Sep 28, 2021
Maintainer

jaffe-fly Sep 28, 2021
Author

jaffe-fly
Sep 28, 2021
Author

VahidooX Oct 6, 2021
Collaborator

jaffe-fly Oct 8, 2021
Author