CTC beam search without language model #4064

hamjam · 2022-04-26T08:15:27Z

hamjam
Apr 26, 2022

Hi and thank you for your great toolkit,

@titu1994
It seems that there is not any CTC decoding strategy for CTC models except using BeamSearchDecoderWithLM class or greedy strategy. First, there is no mention in the BeamSearchDecoderWithLM's docstring for how to perform beam search without lm. Then, the class just uses ctc_decoders from OpenSeq2Seq and it doesn't implement beam search directly for more flexibility. Is there any limitation to mentioned problems? Is it possible to implement CTC beam search without language modeling?

Thanks in advance

VahidooX · 2022-04-26T19:13:06Z

VahidooX
Apr 26, 2022
Collaborator

ctc_decoders is a very fast implementation of beam search decoding. You can use them with or without LM. You may just pass None as the lm_path to have a regular beam search decoding. You may take a look here in the docs and the scripts to learn more on the detail:

https://github.com/NVIDIA/NeMo/blob/main/scripts/asr_language_modeling/ngram_lm/eval_beamsearch_ngram.py

https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/asr_language_modeling.html

If you are looking to make changes to the beam search decoding, then you can use this library which is fully pythonic and works with nemo models:

https://github.com/kensho-technologies/pyctcdecode

https://blog.kensho.com/pyctcdecode-a-new-beam-search-decoder-for-ctc-speech-recognition-2be3863afa96

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CTC beam search without language model #4064

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

CTC beam search without language model #4064

Uh oh!

Uh oh!

hamjam Apr 26, 2022

Replies: 1 comment

Uh oh!

VahidooX Apr 26, 2022 Collaborator

hamjam
Apr 26, 2022

VahidooX
Apr 26, 2022
Collaborator