How to use talknet-aligner #4337

abhinavdayal · 2022-06-05T15:21:42Z

abhinavdayal
Jun 5, 2022

Hi,

I am trying to explore the talknet-aligner model
https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/asr_talknet_aligner

ASR-based text/audio aligner based on CTC-loss algorithm that was used to train TalkNet.

I was wondering how to use this, because the link says to refer to TTS inference which talks about spectrogram generation. However this model is not suppose to generate spectrogram. Any hints on how to use this model will be very helpful.

When I use the transcribe method on the model, it does give the phonetic transcription, but I don't find a way to use it for alignment of a given text.

Thanks
Abhinav

Answered by redoctopus

Jun 6, 2022

Hi, the TalkNet Aligner has been deprecated, but if you just want to play around with it, you can check out the "Extracting phoneme ground truth durations" section of this notebook (as of the 1.8.0 release; it's since been removed): https://github.com/NVIDIA/NeMo/blob/r1.8.0/tutorials/tts/TalkNet_Training.ipynb

We'll be moving to the RadTTS Aligner and will upload a checkpoint in the near future.

View full answer

redoctopus · 2022-06-06T21:29:42Z

redoctopus
Jun 6, 2022
Collaborator

Hi, the TalkNet Aligner has been deprecated, but if you just want to play around with it, you can check out the "Extracting phoneme ground truth durations" section of this notebook (as of the 1.8.0 release; it's since been removed): https://github.com/NVIDIA/NeMo/blob/r1.8.0/tutorials/tts/TalkNet_Training.ipynb

We'll be moving to the RadTTS Aligner and will upload a checkpoint in the near future.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to use talknet-aligner #4337

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to use talknet-aligner #4337

Uh oh!

Uh oh!

abhinavdayal Jun 5, 2022

Replies: 1 comment

Uh oh!

redoctopus Jun 6, 2022 Collaborator

abhinavdayal
Jun 5, 2022

redoctopus
Jun 6, 2022
Collaborator