Replies: 2 comments
-
My understanding from the way residual connections work for this model (Marblenet, https://arxiv.org/pdf/2010.13886.pdf) is that at the end of the block, we should have
which is a list of tensors, not just one tensor. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Sorry, my mistake I had |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I am trying to load the weights of a model I have trained with nemo, but using only pytorch and torchaudio (as it includes the mfcc extractor as part of the full model). The model is based on Jasper blocks, but I am facing an issue for the blocks with residual connections.
If I use the existing
JasperBlock
andConvASREncoder
implementation, I am getting this for the first block with residual=true:while in the loaded model (using nemo loading function) I am getting this
so basically the difference is trying to use the version without nemo there is this additional part:
I am able to get the "correct" architecture with a hack on the function which builds JasperBlock. Before this line: https://github.com/NVIDIA/NeMo/blob/c1abc047d1d5962019f7789b20d8092c7e55b690/nemo/collections/asr/parts/submodules/jasper.py#LL810C33-L810C33
I am adding the following:
which helps getting the same structure in all the blocks as "expected".
The issue is then, I am able to load the model but the inference fails with this:
Could you help me understand what is happening and how to use the correct architectures which lets me load the model and do inference with it?
Beta Was this translation helpful? Give feedback.
All reactions