Replies: 1 comment 4 replies
-
@ninedesu This is an excellent question, and yes, we plan to build a community where people can contribute data for fine-tuning. At the moment, we are gathering all our internal and external datasets (eg https://huggingface.co/datasets/ds4sd/DocLayNet) and preparing them so we can share them all on the huggingface website! With regard to OCR, we have a bit of work to do and are right now relying on 3rd party OCR. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I want to know if we can use our own dataset to finetune the OCR
Beta Was this translation helpful? Give feedback.
All reactions