How adaptive bucketing should work? #3302
-
As I understood, for bucketing to work I have to provide multiple manifest and same amount of tarred_audio_filepaths. So for each pair of manifest and tarred_audio_filepath BucketingDataset with different batch size will be created, right? If it is correct, I don't understand how and why it could make training faster |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
When you use bucketing, audios with similar lengths would be in the same batch, so you would have fewer paddings and it makes the training faster as max_length in a batch would be smaller on average. Adaptive bucketing makes that even more efficient. If you use adaptive bucketing, then buckets with smaller audios would have larger batches and it utilizes the GPUs more efficiently and each epoch would be be done faster. It can give more than 2x speedup in training. |
Beta Was this translation helpful? Give feedback.
When you use bucketing, audios with similar lengths would be in the same batch, so you would have fewer paddings and it makes the training faster as max_length in a batch would be smaller on average. Adaptive bucketing makes that even more efficient. If you use adaptive bucketing, then buckets with smaller audios would have larger batches and it utilizes the GPUs more efficiently and each epoch would be be done faster. It can give more than 2x speedup in training.