Replies: 1 comment
-
You could try converting a specific page with:
In my case, page number detection isn’t obvious when converting the whole PDF at once. I use page_range to handle one page at a time, add the page number in the resulting Markdown, and then repeat the same process for the remaining pages. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi community,
I have a question regarding PDF processing with Docling. I'm trying to convert a PDF and would like to retain the page number information for the extracted content.
Is there a way to either:
Extract content page by page, or
Have the extracted text indicate which page each section belongs to?
Any guidance or suggestions would be greatly appreciated. Thanks in advance!
https://docling-project.github.io/docling/reference/document_converter/
Beta Was this translation helpful? Give feedback.
All reactions