Potential bug: output of Tesseract (C-API) and Tesseract (sh) is different
Created by: mathiasimmer
I got this simple example:
from PIL import Image
from pyocr import pyocr
py_img = Image.open('text.png')
for tool in pyocr.get_available_tools():
print("Using pyocr tool '%s'" % (tool.get_name()))
print(tool.image_to_string(py_img))
As this is a fairly simple case I would have expected the outcome to be the same however the outcome is:
Using pyocr tool 'Tesseract (C-API)'
Empty page!!
Using pyocr tool 'Tesseract (sh)'
3/2
Is this a bug or are the two tools configured differently by default? I know the Tesseract (C-API) works properly on my computer as I have used it successfully with similar but different input, however in this very particular case, it fails.