Hyperparameter suggestions #867
-
Hi! I am interested in trying different hyperparameters in fine-tuning for T5-base. After checking the doc, I found a gin file with many parameters. I was wondering where may I find a list of hyperparameters (and some good initial values) to tune on? Currently, I have the below list based on the gin file.
Could you please give me some suggestions on hyperparameter suggestions? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
I have only ever observed marginal gains from using anything other than the default suggested fine-tuning hyperparameter settings (constant LR = 0.001, dropout = 0.1, etc). |
Beta Was this translation helpful? Give feedback.
I have only ever observed marginal gains from using anything other than the default suggested fine-tuning hyperparameter settings (constant LR = 0.001, dropout = 0.1, etc).