Replies: 3 comments 4 replies
-
See the class ImageRefMode(str, Enum):
"""ImageRefMode."""
PLACEHOLDER = "placeholder" # just a place-holder
EMBEDDED = "embedded" # embed the image as a base64
REFERENCED = "referenced" # reference the image via uri |
Beta Was this translation helpful? Give feedback.
-
when you are exporting the markdown using the doc.export_to_markdown(),you can specify the image_mode. full_markdown = doc.export_to_markdown( |
Beta Was this translation helpful? Give feedback.
-
@itsyaboyksi @sreena-certaintiai @itsyaboyksi
This gives you access to the image blocks and any OCR-extracted text inside them. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I am trying to convert a PDF to Markdown using Docling, but instead of extracting the images, I am getting a <--image--> placeholder in the output.
Used
pipeline_options.do_ocr = True
to enable OCR.Beta Was this translation helpful? Give feedback.
All reactions