Re: [orca-list] Anyone able to OCR a PDF file?



Am 04.01.2012 02:13, schrieb Jason White:
Janina Sajka<janina rednote net>  wrote:

I know people do this on other OS's. Has anyone suggestions on how to do
this in Linux?
i

Cool I've found a solution for this two weeks ago :-).
Don't use convert from imagemagick.
use pdftoppm from poppler-utils instead.

1. pdftoppm your.pdf output
2. for i in *.ppm ; do
cuneiform $i -o $i.txt -l lang_name
done

HTH.
Halim




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]