Re: [Tracker] performance problems in the pdf extractor?



On 10/02/10 14:55, Carlos Garnacho wrote:
Hi!,
Ideally (at least for my case), there should be a quick has_text()
function in poppler, so we don't make it uselessly go through all
streams. But this boils down to a more general problem, while extractors
should give up after some time, there should be a more elegant way to
tell the miner so than a DBus timeout, I plan to work on this soon.

I wonder if the patch on this bug could be used to improve the PDF situation?

  https://bugzilla.gnome.org/show_bug.cgi?id=609075

Also PDF has always been slow to extract.

--
Regards,
Martyn



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]