Would it be possible for you to share this shell script?
“ Never bend your head. Always hold it high. Look the world straight in the eye.” ~ Helen Keller On Apr 16, 2021, at 5:46 AM, Rynhardt Kruger via orca-list <orca-list gnome org> wrote:
I usually get the best results with Chromium's built-in PDF reader. It places each page in a landmarked region, allowing you to quickly jump between pages with m and shift+m. It also supports at least some PDF accessability tags.
For broken PDFs requiring some cleanup I use pdftotext from poppler to convert them to text. If the PDF is really badly broken, or if it's an image PDF, I use pdftocairo to convert each page to a PNG image, and Tesseract to OCR the images (I have a small shell script to automate this process).
Regards,
Rynhardt
_______________________________________________orca-list mailing listorca-list gnome orghttps://mail.gnome.org/mailman/listinfo/orca-listOrca wiki: https://wiki.gnome.org/Projects/OrcaOrca documentation: https://help.gnome.org/users/orca/stable/GNOME Universal Access guide: https://help.gnome.org/users/gnome-help/stable/a11y.html
|