-
I try to train the ASR model with 2TB of data. It takes around 50 min to start actual training. Could we make it faster? I'm using 16 a100 40 GB GPU. Do you have any suggestions for training the ASR model with 2TB data to efficiently use GPU and faster training? I'm using the Thanks |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
Startup time is for manifest processing. You can try using sharded manifests, but it is under testing and not fully documented yet but you can use the following docs - https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/datasets.html#tarred-datasets Ctrf-F "sharded manifest" |
Beta Was this translation helpful? Give feedback.
-
I also suggest to use FastConformer models instead of regular Conformer. It gives you more than 2x speedup in training and inference. |
Beta Was this translation helpful? Give feedback.
Startup time is for manifest processing. You can try using sharded manifests, but it is under testing and not fully documented yet but you can use the following docs - https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/datasets.html#tarred-datasets
Ctrf-F "sharded manifest"