Re: [Tracker] performance problems in the pdf extractor?
- From: Martyn Russell <martyn lanedo com>
- To: Carlos Garnacho <carlos lanedo com>
- Cc: Tracker-List <tracker-list gnome org>
- Subject: Re: [Tracker] performance problems in the pdf extractor?
- Date: Wed, 10 Feb 2010 15:32:53 +0100
On 10/02/10 14:55, Carlos Garnacho wrote:
Hi!,
Ideally (at least for my case), there should be a quick has_text()
function in poppler, so we don't make it uselessly go through all
streams. But this boils down to a more general problem, while extractors
should give up after some time, there should be a more elegant way to
tell the miner so than a DBus timeout, I plan to work on this soon.
I wonder if the patch on this bug could be used to improve the PDF
situation?
https://bugzilla.gnome.org/show_bug.cgi?id=609075
Also PDF has always been slow to extract.
--
Regards,
Martyn
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]