Re: [Tracker] pdf indexing problem



Hi, I updated to the new 0.10.1, but it seems worse for me. Now it does not extract any text it seems (see below). I did install the new poppler, but I dont know if it needs any specific configurations when installing.

cheers,
bjorn


bjorn bjorn-laptop:~$ /usr/libexec/tracker-extract -v 3 -f /home/bjorn/Dropbox/storage/BK792KTA/goldstein.pdf
Initializing tracker-extract...
Tracker-Message: Setting up monitor for changes to config file:'/home/bjorn/.config/tracker/tracker-extract.cfg'
Tracker-Message: Loading defaults into GKeyFile...
Initializing Storage...
Mount monitors set up for to watch for added, removed and pre-unmounts...
No mounts found to iterate
Setting process priority
Could not load module 'libextract-jpeg.so': /usr/lib/tracker-0.10/extract-modules/libextract-jpeg.so: undefined symbol: iptc_jpeg_ps3_find_iptc
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-msoffice-xml.so' with:
  Specific match for mime:'application/vnd.openxmlformats-officedocument.presentationml.presentation'
  Specific match for mime:'application/vnd.openxmlformats-officedocument.presentationml.slideshow'
  Specific match for mime:'application/vnd.openxmlformats-officedocument.spreadsheetml.sheet'
  Specific match for mime:'application/vnd.openxmlformats-officedocument.wordprocessingml.document'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-flac.so' with:
  Specific match for mime:'audio/x-flac'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-oasis.so' with:
  Generic  match for mime:'application/vnd.oasis.opendocument.*'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-mp3.so' with:
  Specific match for mime:'audio/mpeg'
  Specific match for mime:'audio/x-mp3'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-gstreamer.so' with:
  Generic  match for mime:'audio/*'
  Generic  match for mime:'video/*'
  Generic  match for mime:'image/*'
  Specific match for mime:'image/svg+xml'
  Specific match for mime:'video/3gpp'
  Specific match for mime:'video/mp4'
  Specific match for mime:'video/x-ms-asf'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-gstreamer-helix.so' with:
  Specific match for mime:'audio/vnd.rn-realaudio'
  Specific match for mime:'audio/x-pn-realaudio'
  Specific match for mime:'audio/x-pn-realaudio-plugin'
  Specific match for mime:'video/vnd.rn-realvideo'
  Specific match for mime:'video/x-pn-realvideo'
  Specific match for mime:'application/vnd.rn-realmedia'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-vorbis.so' with:
  Specific match for mime:'audio/x-vorbis+ogg'
  Specific match for mime:'application/ogg'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-xmp.so' with:
  Specific match for mime:'application/rdf+xml'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-icon.so' with:
  Specific match for mime:'image/vnd.microsoft.icon'
Could not load module 'libextract-pdf.so': libpoppler-glib.so.6: cannot open shared object file: No such file or directory
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-text.so' with:
  Generic  match for mime:'text/*'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-mplayer.so' with:
  Generic  match for mime:'audio/*'
  Generic  match for mime:'video/*'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-ps.so' with:
  Specific match for mime:'application/x-gzpostscript'
  Specific match for mime:'application/postscript'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-tiff.so' with:
  Specific match for mime:'image/tiff'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-playlist.so' with:
  Specific match for mime:'audio/x-mpegurl'
  Specific match for mime:'audio/mpegurl'
  Specific match for mime:'audio/x-scpls'
  Specific match for mime:'audio/x-pn-realaudio'
  Specific match for mime:'application/ram'
  Specific match for mime:'application/vnd.ms-wpl'
  Specific match for mime:'application/smil'
  Specific match for mime:'audio/x-ms-asx'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-abw.so' with:
  Specific match for mime:'application/x-abiword'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-html.so' with:
  Specific match for mime:'text/html'
  Specific match for mime:'application/xhtml+xml'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-png.so' with:
  Specific match for mime:'image/png'
  Specific match for mime:'sketch/png'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-totem.so' with:
  Generic  match for mime:'audio/*'
  Generic  match for mime:'video/*'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-gif.so' with:
  Specific match for mime:'image/gif'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-msoffice.so' with:
  Specific match for mime:'application/msword'
  Specific match for mime:'application/vnd.ms-powerpoint'
  Specific match for mime:'application/vnd.ms-excel'
  Generic  match for mime:'application/vnd.ms-*'
<--- [1|0] tracker_extract_get_metadata_by_cmdline(uri:'file:///home/bjorn/Dropbox/storage/BK792KTA/goldstein.pdf', mime:(null))
---- [1|0]   Guessing mime type as 'application/pdf' for uri:'file:///home/bjorn/Dropbox/storage/BK792KTA/goldstein.pdf'
---- [1|0]   Could not find any extractors to handle metadata type (mime: application/pdf)
---- [1|0]
---- [1|0]
---> [1|0] Success, no error given
bjorn bjorn-laptop:~$




On Mon, Mar 7, 2011 at 11:42, Aleksander Morgado <aleksander lanedo com> wrote:

> I used the following script to configure. I don know from this whether
> FTS is enabled?
> probably I will reinstall when the ppa has the 0.10 version.
>
> The text I want to search for is inside the pdf. tracker-extract spits
> out all the text from the pdf and I can find my search term in this
> text. Tracker-needle finds nothing though.
>
> /bjorn
>
> # !/bin/bash
>
> EXTRA_ARG=$1
>
> ./configure \
>     --prefix=/usr --sysconfdir=/etc --localstatedir=/var \
>     --enable-libstreamanalyzer=no \
>     --enable-unit-tests=yes \
>     --enable-maemo=yes \
>     --enable-gstreamer-tagreadbin=yes \
>     --enable-gdkpixbuf=yes \
>     --enable-poppler=yes \
>     --enable-video-extractor=gstreamer \
>     --enable-gstreamer-helix=yes \
>     --enable-libgsf=yes \
>     --enable-gnome-keyring=yes \
>     --enable-miner-evolution=no \
>     --enable-miner-rss=no \
>     --enable-miner-flickr=no \
>     --enable-tracker-explorer=yes \
>     --enable-tracker-needle=yes \
>     --enable-tracker-preferences=yes \
>     --enable-libexif=yes \
>     --enable-libiptcdata=yes \
>     --enable-libjpeg=yes \
>     --enable-libgif=yes \
>     --enable-libtiff=yes \
>     --enable-libvorbis=yes \
>     --enable-libflac=yes \
>     --enable-exempi=yes \
>     --enable-taglib=yes \
>     --enable-playlist=yes \
>     --enable-nautilus-extension=yes \
>     --enable-functional-tests=yes \
>     --enable-network-manager=yes \
>     $EXTRA_ARG


Please also make sure you use an explicitly stated unicode support
library, either
  --with-unicode-support=libunistring
or
  --with-unicode-support=libicu


--
Aleksander




--
______O_________oO________oO______o_______oO__
Björn Johansson
Assistant Professor
Departament of Biology
University of Minho
Campus de Gualtar
4710-057 Braga
PORTUGAL
http://www.bio.uminho.pt
http://sites.google.com/site/bjornhome
Work (direct) +351-253 601517
Private mob. +351-967 147 704
Dept of Biology (secretariate) +351-253 60 4310
Dept of Biology (fax) +351-253 678980


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]