-
Hi, @charlesmindee! Thanks for your great work and library. I am impressed with the accuracy of your trained recognition models on portuguese texts. However, Portuguese has some different accentuation from french. So, I am creating a Portuguese synthetic dataset (10MM words) using Synthtiger https://arxiv.org/pdf/2107.09313v1.pdf and will try to train a portuguese recognition model. So, I have some doubts about how you trained the french SAR model. In the SAR paper, the author looped through groups of dataset samples from both real and synthetic images. My doubt: did you followed the paper steps or usend only a french synthetic dataset? What was the size you used and for how many epochs did you train. Once your results were superb, I think it would be a good start for a portuguese model to emulate what you did. Best regards, |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 13 replies
-
Hello @fmobrj 👋 Thanks a lot for your previous PR and your contribution to Portuguese support in docTr! So for your to train a SAR model in your dataset, here is what you need to do:
Regarding the training hyperparameters, our training set is about the same size as yours so I'd recommend using the training script default for now apart from the number of epochs (increase it to 15 or 20). Also please note, that while the SAR performs well, it is certainly much slower than the CRNN. I would suggest considering training a Let me know if you have any questions :) |
Beta Was this translation helpful? Give feedback.
-
Wow. Thank you very much. I will try the crnn and will report my results here as soon as I progress! Best regards, Fabio. |
Beta Was this translation helpful? Give feedback.
-
Hi, @fg-mindee! Would you mind share the train / test split % you used? I will train using 10MM images I created using synthtiger script. But I am wondering if 20% (2MM) or even 10% (1MM) for validation is not too much. What do you think? Any suggestions? A second question: do you think it would lead to better results to use differential learning rates when finetuning the french checkpoint using the portuguese dataset and vocabs? For eg.: default lr for the new linear layer and lr/10 for the rest of the model parameters. Best regards, |
Beta Was this translation helpful? Give feedback.
Hello @fmobrj 👋
Thanks a lot for your previous PR and your contribution to Portuguese support in docTr!
So for your to train a SAR model in your dataset, here is what you need to do:
--vocab portuguese
)Regarding the training hyperparameters, our training set is…