Re: [Tracker] cvs is a bit hard to test.

From: Anders Aagaard <aagaande gmail com>
To: Jamie McCracken <jamiemcc blueyonder co uk>, tracker-list gnome org
Subject: Re: [Tracker] cvs is a bit hard to test.
Date: Sun, 13 Aug 2006 14:52:07 +0000

Jamie McCracken wrote:

Anders Aagaard wrote:
In the meantime can you use the libextractor that comes with gentoo.During compile, if tracker sees that libextractor is alreadyinstalled then it wont build the inlined version by default.
heh, yeah, that's the problem, that one doesn't work either ;)
please report the bugs to them then.


yeah it's been done.

Nice, but does that currently support full text searches? One of thethings I'd absolutely love to do with tracker is to search usingsomething like "type:video funny", to find funny video's (type videocan use mime types to find all the extensions). Or even "match:*.avifunny".
should be possible in the near future. for now you would have to use rdfquery to do that.


very nice

All I found on the subject was this:
http://www.sqlite.org/cvstrac/wiki/wiki?p=FullTextIndex
We will be using qdbm to create an inverted word index.
This is just a file based hash table that takes a word and returnsDocId's and associated scores which we can stick in a temporary table insqlite for further processing.
Im confident we will get a faster and more lightweight tracker by doingthis along with bonus features like stemming and score weighting ondifferent types of metadata (none of which is supported in mysql).
If in the future, sqlite can deliver equivalent functionality withoutsacrificing speed/memory then we can migrate towards that. But I need ascalable solution today that can handle tons of emails as well as files


Very nice, thanks for the information :)

References:
- [Tracker] cvs is a bit hard to test.
  - From: Anders Aagaard
- Re: [Tracker] cvs is a bit hard to test.
  - From: Jamie McCracken
- Re: [Tracker] cvs is a bit hard to test.
  - From: Anders Aagaard
- Re: [Tracker] cvs is a bit hard to test.
  - From: Jamie McCracken

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]