Skip to content

Japanese & Korean OCRs are wildly innacurate #7708

Answered by coastal45
thealexmay asked this question in Q&A
Discussion options

You must be logged in to vote

The Tesseract Manual page is here: TESSERACT(1) Manual Page As you can see, there are many options. At least you need to specify the language, and 8-bit black on clean white background works best. I don't know any Korean, but it seems I read before that characters with vertical lines on the left side (common in Hangul) do not OCR well.
If you have a specific problem, you can bring it up on the Tesseract issues page.

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@thealexmay
Comment options

Comment options

You must be logged in to vote
1 reply
@thealexmay
Comment options

Answer selected by thealexmay
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants