Re: Beagle stop words



> I was wondering if there were any stop words (or non-words) that Beagle
> excludes from its index. Are there terms that are not indexed even if a
> filter returns them? Thanks.

I believe its the list given in
http://svn.gnome.org/viewvc/beagle/trunk/beagle/beagled/Lucene.Net/Analysis/StopAnalyzer.cs?view=markup

In addition to it any word containing too many alphabet-number
switches is also considered garbage (e.g. a group of characters from a
binary file).

- dBera

-- 
-----------------------------------------------------
Debajyoti Bera @ http://dtecht.blogspot.com
beagle / KDE fan
Mandriva / Inspiron-1100 user


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]