Skip to content
Discussion options

You must be logged in to vote

For bucketing you need to pass them as a list of lists like [[],[],[],[]]. Please take a look at the documentation for the right format.
For adaptive, you need to set the train_ds.batch_size=1 and train_ds.bucketing_batch_size to a fixed number for linear scaling or set them manually like this train_ds.bucketing_batch_size = [70,64,56,48,40,32,24,16]. Linear scaling can be aggressive in many cases, so suggest to specify the batch size for each bucket manually like this.
You may set the train_ds.bucketing_strategy=fully_randomized to have lower speedup but probably better accuracy.

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by shuvohishab
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants