Given the images it will automatically outline its contents, distinguish between what's graphics and text and perform OCR over the latter. It generates multiple formats being its main one ODT.
It features a complete GTK+ GUI that allows users to correct any unrecognized characters, define or correct bounding boxes, set paragraph styles, clean the input images, import PDFs, save and load the project, export everything to multiple formats, etc.In this version
===========
* Detect Tesseract, Ocrad and GOCR from the system when no engines are configured
* Simplified Chinese translation, thanks to 刘洪德 (Hongde Liu)
* Add verifications to prevent failure when engines' files don't exist
* Replace usage of minidom by ElementTree in the feeder module
* Add compilemessages and generatepot command to Makefile.
* Correct Philippe Normand's name
* Correct boxes allowed limits when dragging
* Lower the images' border to 5 pixels