Skip to content

Low performance when try to recognize a long string of digits and punctuation #450

Answered by charlesmindee
mouhannadali asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @mouhannadali, Thanks for reporting this.
I guess you used our baseline predictor, which is running a CRNN model for sequence recognition. This model is composed of a 2 LSTM decoder and optimized with a CTC loss, which is less robust on long sequences. This can explain the satisfying results of our model on 2 medium sequences, but the lack of robustness when it has to deal with long sequences.
You may try with attention models (SAR, MASTER) in the recognition predictor, it can lead to better results.
We will focus on lighter attention models in the near future to deal with long sequences, thanks for having reported this. Note that the segmentation seems right because the 2 first lines …

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@mouhannadali
Comment options

@charlesmindee
Comment options

@charlesmindee
Comment options

@mouhannadali
Comment options

Answer selected by charlesmindee
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants