ABBYY OCR support?
Created by: The-Compiler
So I tried using paperwork and was not really satisfied with the results, it looks like Tesseract works as bad with my documents as it did some years ago when I last tried...
I found ABBYY OCR for Linux to work much better (at least for my documents), but I found the tooling around it to be lacking, so I didn't buy it so far (but played with the trial).
What do you think about integration of that into PyOCR? It seems to have an XML export with character box information, so I think that should work.
If you agree with the idea, I might contribute one day - but I'm currently very busy with my own projects, so that'll probably take a few months.