Re: [Tracker] not indexing text from PDF files



On 10/31/2013 12:26 PM, Ivan Frade wrote:
Hi,

Hi,

1. Tracker removes "stop words". Words very common like "the" or
"almost"... are you searching with one of those? There are different lists
for each language and you can find them in /usr/share/tracker/languages/

Yeah, not looking for stop words.

2. Tracker indexes only 10000 words in the PDF. Is the first occurence of
the word you search beyond that limit?

Nope.  The document doesn't have anywhere near 10000 words.  In any case
the word I am searching for is near the beginning.

Maybe it would help to manually run the command that tracker uses to
extract the text from a PDF to see if that's working.

Any ideas if one can do that?

Cheers,
b.


Attachment: signature.asc
Description: OpenPGP digital signature



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]