Re: [orca-list] converting pdf to text



Hello list,
Yes, its useful and in the similar way you can use pdftohtml for simple and complex output. All these are utility functions provided by poppler library and xpdf, evince all uses this library to render pdf.
"pdftohtml -c <filename>" is for complex output that generates page wise formatted (fonts, size, 2-column, images etc) html page for each page of pdf file.
Accessibility team at CDAC mumbai has included another option "pdftohtml -s <filename>" that makes a pdf file more accessible and usable. It produce single html file for corresponding pdf file, so that user need not to open each page separately.

With regards
Leena C
For accessibility team @ CDAC Mumbai

On Wed, Sep 8, 2010 at 8:16 AM, Jonathan Nadeau <j nadeau charter net> wrote:
Hello list,

I just wanted to let everyone know about this if you didn't know
already. If you have xpdf installed you can convert pdf to text really
easy. This is how I do it. First I have the pdf in my home folder then I
go to the terminal and type this

pdftotext filename

then hit enter and when it is done you will find the same named pdf as a
text file in your home folder it works great.

_______________________________________________
orca-list mailing list
orca-list gnome org
http://mail.gnome.org/mailman/listinfo/orca-list
Visit http://live.gnome.org/Orca for more information on Orca.
The manual is at http://library.gnome.org/users/gnome-access-guide/nightly/ats-2.html
The FAQ is at http://live.gnome.org/Orca/FrequentlyAskedQuestions
Netiquette Guidelines are at http://live.gnome.org/Orca/FrequentlyAskedQuestions/NetiquetteGuidelines
Log bugs and feature requests at http://bugzilla.gnome.org
Find out how to help at http://live.gnome.org/Orca/HowCanIHelp



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]