Re: [orca-list] PDF reader



Leena Chourey <leenagour gmail com> wrote:
 
You can try "pdftohtml -s <filename>"   (in poppler-0.15.1) utility. This
will generate <filename-html> file that can be opened in web browser. 

Or you can use pdftotext to create a text file from the PDF document. I
usually start with
pdftotext -layout -enc ASCII7 filename
but if the document has multiple columns or other complex layout then I use
-raw rather than -layout.

The ASCII7 encoding is great for English text, but not for other languages; I
use it on English-language documents to convert ligatures and kerning (often
represented as special characters in the font) into their underlying letters.




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]