How docling differentiates between scanned image-page and embedded image #1540
Unanswered
mudassir206
asked this question in
Q&A
Replies: 1 comment
-
@dolfim-ibm --i will be grateful if you can look into it |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
we have been working on arabic pdf files.The docling pipeline is ok at the moment, but we are looking for some configurations which enhances text extraction from pdf.
grateful, if anyone could able to answer this.
The goal is
1.differentiate between scanned image-page and embedded imagetract
2.extract text from embedded image
Beta Was this translation helpful? Give feedback.
All reactions