Use ASR Adapters with Citrinet model #4334
Replies: 2 comments 8 replies
-
Citrinet is compatible with adapters and followed the same steps. You may need to modify the optimization hyper parameters slightly but it should still work. Can you explain in detail what you tried and what was the result. |
Beta Was this translation helpful? Give feedback.
-
@manuel3265 This was a fun experiment. There are a few issues with the script you had written and an interesting side effect of Citrinet old checkpoints itself. A modified copy of your colab is here - https://colab.research.google.com/drive/1plDHQVucsNr7QqMRCl_EY5E1X0HBrI7z?usp=sharing
Still, the above model got to around 75% WER after 4 epochs. So what was the problem?
Theres obviously other optimization you can do, reduce dropout, increase LR, reduce adapter dim, increase adapter dim etc. I will leave that exploration to you. Just know that adapters in ASR are relatively new (at least in NeMo). There arent so many empirically well defined topics, and each model is bound to have some challenges. But the technique does work well when it works, bringing the WER down from 7% to 3.5% in around 10 mins of training. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello, is there someone who can help me use the adapters with models other than the conformer ones. I want to use the adapters with the citrinet models and would like to know if it is possible and how I could do it.
Beta Was this translation helpful? Give feedback.
All reactions