Skip to content
#

kannada-text-ocr

Here is 1 public repository matching this topic...

This project extracts handwritten Kannada text from PDF images using PyMuPDF and Tesseract OCR. It processes images for better contrast using OpenCV, improving text recognition accuracy. The script also filters out unwanted patterns like URLs and digits, ensuring clean output. This tool is ideal for digitizing Kannada handwritten documents.

  • Updated Sep 19, 2024
  • Python

Improve this page

Add a description, image, and links to the kannada-text-ocr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the kannada-text-ocr topic, visit your repo's landing page and select "manage topics."

Learn more