Replies: 2 comments 9 replies
-
Hey @OriAlpha! I'm not sure to what exactly you are referring to with model definition. Could you please elaborate on what information you need? Then I can help you finding that information. :) |
Beta Was this translation helpful? Give feedback.
8 replies
-
I have tried whole process, but distilled model results in accurcay drop (i.e., around 10%), i tried for training for more epochs and dont see any improvements. any suggestion to improve distilled model performance. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
I am trying to create a distill version of gelectra-base model, as known we need model defination. I could not find in paper not at least full? So is there any way i could achieve this?? or can i import from transformers as config
Beta Was this translation helpful? Give feedback.
All reactions