Version 0.0.20 has been released #243
JorjMcKie
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This fixes some more bugs.
Here is an overview of other changes:
ignore_images
andignore_graphics
can be used to omit these objects when determining the page layout and will be omitted in the output. Can be helpful with overly crowded pages.table_strategy
can now beNone
to suppress execution of PyMuPDF's table finder. This can significantly increase conversion speed.max_levels
(an integer between 1 and 6) limits the number of detected header levels. Here is a code snippet:The output produced above will only have headers up to level 3 ("###"). All text with a smaller font size than that of level 3 will be treated as body text.
The complete changes are here.
Beta Was this translation helpful? Give feedback.
All reactions