A containerized version of Microsoft's markitdown tool that converts various file formats (PDF, Word, PowerPoint, etc.) to Markdown format using Docker or Podman containers.
- 🚀 Easy containerized deployment - No complex Python environment setup required
- 📄 Multiple format support - Convert PDF, DOCX, PPTX, and more to Markdown
- 🐳 Container-based - Uses Docker or Podman for isolated execution
- 🛠️ Simple CLI - Straightforward command-line interface
Before installing Contanizer Markitdown, you'll need one of the following container runtimes:
- Docker (Install Docker)
- Podman (Install Podman)
VibeScript is required to run Contanizer Markitdown. Choose the installation method for your operating system:
Option A: Pre-compiled Binary
curl -L https://github.com/OUIsolutions/VibeScript/releases/download/0.36.0/vibescript.out -o vibescript.out
chmod +x vibescript.out
sudo mv vibescript.out /usr/local/bin/vibescript
Option B: Compile from Source
curl -L https://github.com/OUIsolutions/VibeScript/releases/download/0.36.0/amalgamation.c -o vibescript.c
gcc vibescript.c -o vibescript.out
sudo mv vibescript.out /usr/local/bin/vibescript
curl -L https://github.com/OUIsolutions/VibeScript/releases/download/0.36.0/amalgamation.c -o vibescript.c
gcc vibescript.c -o vibescript.out
sudo mv vibescript.out /usr/local/bin/vibescript
Note: Make sure you have GCC installed. You can install it via Xcode Command Line Tools:
xcode-select --install
Once VibeScript is installed, add the markitdown script:
vibescript add_script --file https://github.com/OUIsolutions/Contanizer-Markitdown/releases/download/0.1.0/markitdown.lua markitdown
Choose your preferred container runtime:
For Docker users:
markitdown set_provider docker
For Podman users:
markitdown set_provider podman
Convert any supported file to Markdown:
markitdown your-file.pdf > output.md
Convert a PDF document:
markitdown document.pdf > document.md
Convert a Word document:
markitdown presentation.docx > presentation.md
Convert a PowerPoint presentation:
markitdown slides.pptx > slides.md
Contanizer Markitdown supports the same file formats as the original markitdown tool, including:
- PDF (
.pdf
) - Microsoft Word (
.docx
,.doc
) - Microsoft PowerPoint (
.pptx
,.ppt
) - Microsoft Excel (
.xlsx
,.xls
) - Images with OCR support
- And more...
-
"Provider not set" error: Make sure you've configured your container provider using
markitdown set_provider <docker|podman>
-
Permission denied: Ensure Docker/Podman is running and your user has the necessary permissions
-
Command not found: Verify that VibeScript is properly installed and in your PATH
If you encounter issues, please check the Issues page or create a new issue with details about your problem.
We welcome contributions! Please feel free to submit a Pull Request.
This project is licensed under the same terms as the original markitdown project. See the LICENSE file for details.
- Based on Microsoft's markitdown project
- Built with VibeScript