Re: [Tracker] pdf indexing problem
- From: Björn Johansson <bjorn_johansson bio uminho pt>
- To: tracker-list <tracker-list gnome org>
- Subject: Re: [Tracker] pdf indexing problem
- Date: Mon, 7 Mar 2011 17:59:20 +0000
Hi, I updated to the new 0.10.1, but it seems worse for me. Now it does not extract any text it seems (see below). I did install the new poppler, but I dont know if it needs any specific configurations when installing.
cheers,
bjorn
bjorn bjorn-laptop:~$ /usr/libexec/tracker-extract -v 3 -f /home/bjorn/Dropbox/storage/BK792KTA/goldstein.pdf
Initializing tracker-extract...
Tracker-Message: Setting up monitor for changes to config file:'/home/bjorn/.config/tracker/tracker-extract.cfg'
Tracker-Message: Loading defaults into GKeyFile...
Initializing Storage...
Mount monitors set up for to watch for added, removed and pre-unmounts...
No mounts found to iterate
Setting process priority
Could not load module 'libextract-jpeg.so': /usr/lib/tracker-0.10/extract-modules/libextract-jpeg.so: undefined symbol: iptc_jpeg_ps3_find_iptc
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-msoffice-xml.so' with:
Specific match for mime:'application/vnd.openxmlformats-officedocument.presentationml.presentation'
Specific match for mime:'application/vnd.openxmlformats-officedocument.presentationml.slideshow'
Specific match for mime:'application/vnd.openxmlformats-officedocument.spreadsheetml.sheet'
Specific match for mime:'application/vnd.openxmlformats-officedocument.wordprocessingml.document'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-flac.so' with:
Specific match for mime:'audio/x-flac'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-oasis.so' with:
Generic match for mime:'application/vnd.oasis.opendocument.*'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-mp3.so' with:
Specific match for mime:'audio/mpeg'
Specific match for mime:'audio/x-mp3'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-gstreamer.so' with:
Generic match for mime:'audio/*'
Generic match for mime:'video/*'
Generic match for mime:'image/*'
Specific match for mime:'image/svg+xml'
Specific match for mime:'video/3gpp'
Specific match for mime:'video/mp4'
Specific match for mime:'video/x-ms-asf'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-gstreamer-helix.so' with:
Specific match for mime:'audio/vnd.rn-realaudio'
Specific match for mime:'audio/x-pn-realaudio'
Specific match for mime:'audio/x-pn-realaudio-plugin'
Specific match for mime:'video/vnd.rn-realvideo'
Specific match for mime:'video/x-pn-realvideo'
Specific match for mime:'application/vnd.rn-realmedia'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-vorbis.so' with:
Specific match for mime:'audio/x-vorbis+ogg'
Specific match for mime:'application/ogg'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-xmp.so' with:
Specific match for mime:'application/rdf+xml'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-icon.so' with:
Specific match for mime:'image/vnd.microsoft.icon'
Could not load module 'libextract-pdf.so': libpoppler-glib.so.6: cannot open shared object file: No such file or directory
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-text.so' with:
Generic match for mime:'text/*'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-mplayer.so' with:
Generic match for mime:'audio/*'
Generic match for mime:'video/*'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-ps.so' with:
Specific match for mime:'application/x-gzpostscript'
Specific match for mime:'application/postscript'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-tiff.so' with:
Specific match for mime:'image/tiff'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-playlist.so' with:
Specific match for mime:'audio/x-mpegurl'
Specific match for mime:'audio/mpegurl'
Specific match for mime:'audio/x-scpls'
Specific match for mime:'audio/x-pn-realaudio'
Specific match for mime:'application/ram'
Specific match for mime:'application/vnd.ms-wpl'
Specific match for mime:'application/smil'
Specific match for mime:'audio/x-ms-asx'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-abw.so' with:
Specific match for mime:'application/x-abiword'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-html.so' with:
Specific match for mime:'text/html'
Specific match for mime:'application/xhtml+xml'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-png.so' with:
Specific match for mime:'image/png'
Specific match for mime:'sketch/png'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-totem.so' with:
Generic match for mime:'audio/*'
Generic match for mime:'video/*'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-gif.so' with:
Specific match for mime:'image/gif'
Adding extractor:'/usr/lib/tracker-0.10/extract-modules/libextract-msoffice.so' with:
Specific match for mime:'application/msword'
Specific match for mime:'application/vnd.ms-powerpoint'
Specific match for mime:'application/vnd.ms-excel'
Generic match for mime:'application/vnd.ms-*'
<--- [1|0] tracker_extract_get_metadata_by_cmdline(uri:'file:///home/bjorn/Dropbox/storage/BK792KTA/goldstein.pdf', mime:(null))
---- [1|0] Guessing mime type as 'application/pdf' for uri:'file:///home/bjorn/Dropbox/storage/BK792KTA/goldstein.pdf'
---- [1|0] Could not find any extractors to handle metadata type (mime: application/pdf)
---- [1|0]
---- [1|0]
---> [1|0] Success, no error given
bjorn bjorn-laptop:~$
On Mon, Mar 7, 2011 at 11:42, Aleksander Morgado
<aleksander lanedo com> wrote:
> I used the following script to configure. I don know from this whether
> FTS is enabled?
> probably I will reinstall when the ppa has the 0.10 version.
>
> The text I want to search for is inside the pdf. tracker-extract spits
> out all the text from the pdf and I can find my search term in this
> text. Tracker-needle finds nothing though.
>
> /bjorn
>
> # !/bin/bash
>
> EXTRA_ARG=$1
>
> ./configure \
> --prefix=/usr --sysconfdir=/etc --localstatedir=/var \
> --enable-libstreamanalyzer=no \
> --enable-unit-tests=yes \
> --enable-maemo=yes \
> --enable-gstreamer-tagreadbin=yes \
> --enable-gdkpixbuf=yes \
> --enable-poppler=yes \
> --enable-video-extractor=gstreamer \
> --enable-gstreamer-helix=yes \
> --enable-libgsf=yes \
> --enable-gnome-keyring=yes \
> --enable-miner-evolution=no \
> --enable-miner-rss=no \
> --enable-miner-flickr=no \
> --enable-tracker-explorer=yes \
> --enable-tracker-needle=yes \
> --enable-tracker-preferences=yes \
> --enable-libexif=yes \
> --enable-libiptcdata=yes \
> --enable-libjpeg=yes \
> --enable-libgif=yes \
> --enable-libtiff=yes \
> --enable-libvorbis=yes \
> --enable-libflac=yes \
> --enable-exempi=yes \
> --enable-taglib=yes \
> --enable-playlist=yes \
> --enable-nautilus-extension=yes \
> --enable-functional-tests=yes \
> --enable-network-manager=yes \
> $EXTRA_ARG
Please also make sure you use an explicitly stated unicode support
library, either
--with-unicode-support=libunistring
or
--with-unicode-support=libicu
--
Aleksander
--
______O_________oO________oO______o_______oO__ Björn Johansson
Assistant ProfessorDepartament of Biology
University of MinhoCampus de Gualtar
4710-057 BragaPORTUGAL
http://www.bio.uminho.pthttp://sites.google.com/site/bjornhome
Work (direct) +351-253 601517Private mob. +351-967 147 704
Dept of Biology (secretariate) +351-253 60 4310Dept of Biology (fax) +351-253 678980
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]