Re: [Tracker] not indexing text from PDF files



On 10/31/2013 01:41 PM, Aleksander Morgado wrote:
/usr/libexec/tracker-extract -f /path/to/file.pdf

Cool.  So, that on a the given file I see:

SPARQL item:
--
 a nfo:PaginatedTextDocument ;
         nie:title "NONE" ;
         nie:subject "NONE" ;
         nco:creator [ a nco:Contact ;
         nco:fullname "NONE"] ;
         nao:hasTag ?tag1 ;
         nfo:pageCount 1 ;
         nie:plainTextContent
"list\nof\nwords\nseparated\nby\ncarriage-returns\n'' " .

Searching for any of those words in the plainTextContent item fails.

Cheers,
b.


Attachment: signature.asc
Description: OpenPGP digital signature



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]