-
Notifications
You must be signed in to change notification settings - Fork 19
Open
Description
I'm just wondering a bit about different recognition results using tesseract5.3.0 and OCR-D with ocrd-olena-binarize && ocrd-tesserocr-segment
.
Original TIF: https://digi.ub.uni-heidelberg.de/diglitData/v/heidelberg1592_-_04manual.tif
Result using tesseract5.3.0 -l Fraktur_GT4Hist...
(right column = ground truth)
and using tesserocr-segment and calamari-recognize (fraktur_historical1.0
) with OCR-D:
and using tesserocr-segment and tesserocr-recognize (Fraktur_GT4Hist...
) with OCR-D:
It seems that OCR-D-"tesserocr" segmentation is somewhat different to OCR-D segmentation (perhaps because olena-binarize?), but I can't find a big change in line/region/segmentation etc. in the tesseract changelog the last year.
Metadata
Metadata
Assignees
Labels
No labels