Is it possible to loading ASR data faster for ~20k hr data and make training faster? #6825

mehadi92 · 2023-06-07T04:33:38Z

mehadi92
Jun 7, 2023

I try to train the ASR model with 2TB of data. It takes around 50 min to start actual training. Could we make it faster? I'm using 16 a100 40 GB GPU.

Do you have any suggestions for training the ASR model with 2TB data to efficiently use GPU and faster training?

I'm using the conformer transducer medium model and it was taking ~6hr for 1 epoch without data bucketing.
I'm already using mixed precession(bf16)

Thanks

Answered by titu1994

Jun 7, 2023

Startup time is for manifest processing. You can try using sharded manifests, but it is under testing and not fully documented yet but you can use the following docs - https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/datasets.html#tarred-datasets

Ctrf-F "sharded manifest"

View full answer

titu1994 · 2023-06-07T20:22:30Z

titu1994
Jun 7, 2023
Maintainer

Startup time is for manifest processing. You can try using sharded manifests, but it is under testing and not fully documented yet but you can use the following docs - https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/datasets.html#tarred-datasets

Ctrf-F "sharded manifest"

0 replies

VahidooX · 2023-06-15T03:58:30Z

VahidooX
Jun 15, 2023
Collaborator

I also suggest to use FastConformer models instead of regular Conformer. It gives you more than 2x speedup in training and inference.

4 replies

mehadi92 Jun 15, 2023
Author

@VahidooX YES fast conformer is 2X faster

mehadi92 Jun 15, 2023
Author

But there is no pre-train of a fast conformer model with 128 tokenizers. All fast-conformer model comes with 1024 tokens

VahidooX Jun 15, 2023
Collaborator

What is the problem with vocab size 1024?

mehadi92 Jun 15, 2023
Author

@VahidooX Not much problem with vocab size. Need to create a new tokenize with vocab size 1024. Could not use the same vocab that I can use in conformer ctc models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is it possible to loading ASR data faster for ~20k hr data and make training faster? #6825

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 4 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Is it possible to loading ASR data faster for ~20k hr data and make training faster? #6825

Uh oh!

mehadi92 Jun 7, 2023

Replies: 2 comments · 4 replies

Uh oh!

titu1994 Jun 7, 2023 Maintainer

Uh oh!

VahidooX Jun 15, 2023 Collaborator

Uh oh!

mehadi92 Jun 15, 2023 Author

Uh oh!

mehadi92 Jun 15, 2023 Author

Uh oh!

VahidooX Jun 15, 2023 Collaborator

Uh oh!

mehadi92 Jun 15, 2023 Author

mehadi92
Jun 7, 2023

Replies: 2 comments 4 replies

titu1994
Jun 7, 2023
Maintainer

VahidooX
Jun 15, 2023
Collaborator

mehadi92 Jun 15, 2023
Author

mehadi92 Jun 15, 2023
Author

VahidooX Jun 15, 2023
Collaborator

mehadi92 Jun 15, 2023
Author