Potential bug: output of Tesseract (C-API) and Tesseract (sh) is different
Created by: mathiasimmer
I got this simple example:
from PIL import Image from pyocr import pyocr py_img = Image.open('text.png') for tool in pyocr.get_available_tools(): print("Using pyocr tool '%s'" % (tool.get_name())) print(tool.image_to_string(py_img))
As this is a fairly simple case I would have expected the outcome to be the same however the outcome is:
Using pyocr tool 'Tesseract (C-API)' Empty page!! Using pyocr tool 'Tesseract (sh)' 3/2
Is this a bug or are the two tools configured differently by default? I know the Tesseract (C-API) works properly on my computer as I have used it successfully with similar but different input, however in this very particular case, it fails.