Ignore boxes at (0, 0)
For some reason, Tesseract sometimes return boxes (word or line) that occupy the whole image. This cause problems with applications like Paperwork.
Pyocr should drop those boxes (or give them a size of 0x0 so their content can still be indexed ?).