Commit e49d9188 authored by Jerome Flesch's avatar Jerome Flesch

README: details installation instructions

Signed-off-by: Jerome Flesch's avatarJerome Flesch <jflesch@openpaper.work>
parent 7e33d79a
......@@ -19,6 +19,7 @@ bmp, tiff, and others. It also support bounding box data.
* Tesseract (fork + exec)
* Cuneiform (fork + exec)
## Features
* Support all the image formats supported by [Pillow](https://github.com/python-imaging/Pillow)
......@@ -26,10 +27,28 @@ bmp, tiff, and others. It also support bounding box data.
* Can focus on digits only (Tesseract only)
* Can save and reload boxes in hOCR format
## Limitations
* hOCR: Only a subset of the specification is supported. For instance, pages and paragraph positions are not stored.
## Installation
```sh
$ sudo pip install pyocr # Python 2.7
$ sudo pip3 install pyocr # Python 3.0
```
or the manual way:
```sh
$ mkdir -p ~/git ; cd git
$ git clone https://github.com/jflesch/pyocr.git
$ cd pyocr
$ sudo python ./setup.py install
```
## Usage
### Initialization
......@@ -139,11 +158,6 @@ detected in the image.
* or cuneiform
## Installation
$ sudo python ./setup.py install
## Tests
$ python ./run_tests.py
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment