Re: [Tracker] tracker as full text index/search tool for a large collection of pdf, ps, djvu, dvi documents?



2008/10/4 Meik Hellmund <Meik Hellmund math uni-leipzig de>:

Dear tracker developers,

I have a collection of ca 10000 documents, mostly Postscript, PDF,
DjVu and DVI format and I am looking for a full text index/search
tool. I tried tracker 0.6.6 from Debian/unstable and have now some
questions where I didn't find the answer in the docu and faq.

 - Tracker works fine and great with PDF documents. Full points!
  That's what I am looking for.
  But:

 - It seems that Postscript, Dvi and Djvu documents are not fully
  indexed, only the metadata are used. How can I change this?

For djvu, there is already a a filter
/usr/lib/tracker/filters/text/djvu_filter

It should index the content of djvu files, but it requires the
djvulibre-bin package being installed. (The tracker deb package has a
recommends on this package).

Ivan already posted instruction how to create filters for ps and dvi.

If you have created working filters for these mimetypes feel free to
send them to us so we can include them upstream.

Cheers,
Michael

-- 
Why is it that all of the instruments seeking intelligent life in the
universe are pointed away from Earth?



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]