Minimum LibreOffice packages to parse files #2610
Unanswered
nikoszaf41
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I am looking into using unstructured to parse
.doc
files in an application that will be running in a docker container.Since I want the image to have the smallest possible size i've been searching to find the LibreOffice packages for
ppt
anddoc
partitioning.What are the minimum required packages I need to install in the container for this
doc
todocx
conversion to run?soffice --headless --convert-to docx example.doc
I've checked
libreoffice-nogui
but it also includes many packages not required for these files typesThe base image I'm using is
python:3.10-slim
Beta Was this translation helpful? Give feedback.
All reactions