Tesseract-processed PDFs (OCR-processed, or PDFs made searchable with tesseract) show squares/rectangles/boxes when you select text :(
I processed this PDF with tesseract using pdf2searchablepdf mypdf
. (Get pdf2searchablepdf
, a light-weight tesseract wrapper written in bash, here).
Now, when I select text in ubuntu 18.04's Document Viewer (Evince), it just shows boxes!:
In Foxit Reader, however, it looks fine when I select text:
The boxes are a bug. It should show selected text like Foxit Reader does instead.
Related: https://github.com/ElectricRCAircraftGuy/PDF2SearchablePDF/issues/8
Edited by Gabriel Staples