Low performance when try to recognize a long string of digits and punctuation #450
-
I run the model on the following image: and I got the following output when the model is considering the one string as 2 different strings then the recognition is correct otherwise I am getting the wrong output. I test the same image on Amazon Textract and the output was correct. any idea what can be done here? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
Hi @mouhannadali, Thanks for reporting this. |
Beta Was this translation helpful? Give feedback.
Hi @mouhannadali, Thanks for reporting this.
I guess you used our baseline predictor, which is running a CRNN model for sequence recognition. This model is composed of a 2 LSTM decoder and optimized with a CTC loss, which is less robust on long sequences. This can explain the satisfying results of our model on 2 medium sequences, but the lack of robustness when it has to deal with long sequences.
You may try with attention models (SAR, MASTER) in the recognition predictor, it can lead to better results.
We will focus on lighter attention models in the near future to deal with long sequences, thanks for having reported this. Note that the segmentation seems right because the 2 first lines …