Re: Will Beagle index PDFs?
- From: Matt Jones <mattharrison sbcglobal net>
- To: albertvilella terra es
- Cc: dashboard-hackers gnome org
- Subject: Re: Will Beagle index PDFs?
- Date: Wed, 21 Jul 2004 12:33:22 -0700
Hi -
> I think that files that have been "scanned" and converted to PDF without
> being "OCRed" are like big images inserted in pdf headers, so I don't
> know how is that indexed in Lucene.
>
> Anyone?
>
Some of the OCR software I've used allows you to save the result as a
pdf to preserve most of the visual elements (like graphics or
watermarks), but still converts the text elements to text.
-- Matt
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]