monkey-ocr

Here is 1 public repository matching this topic...

PRITHIVSAKTHIUR / Multimodal-OCR

Vision Language Model : tailored for tasks that involve [messy] optical character recognition (ocr), image-to-text conversion, and math problem solving with latex formatting.

pillow video-processing opencv-python video-understanding ocr-recognition ocr-python huggingface-transformers qwen2-vl-2b qwen2-5-vl monkey-ocr

Updated Jul 26, 2025
Python

Improve this page

Add a description, image, and links to the monkey-ocr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the monkey-ocr topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

monkey-ocr

Here is 1 public repository matching this topic...

PRITHIVSAKTHIUR / Multimodal-OCR

Improve this page

Add this topic to your repo