Re: Beagle stop words
- From: "D Bera" <dbera web gmail com>
- To: "Andrew Leung" <aleung soe ucsc edu>
- Cc: Beagle-Mailing-List <dashboard-hackers gnome org>
- Subject: Re: Beagle stop words
- Date: Fri, 25 Jul 2008 21:17:05 -0400
> I was wondering if there were any stop words (or non-words) that Beagle
> excludes from its index. Are there terms that are not indexed even if a
> filter returns them? Thanks.
I believe its the list given in
http://svn.gnome.org/viewvc/beagle/trunk/beagle/beagled/Lucene.Net/Analysis/StopAnalyzer.cs?view=markup
In addition to it any word containing too many alphabet-number
switches is also considered garbage (e.g. a group of characters from a
binary file).
- dBera
--
-----------------------------------------------------
Debajyoti Bera @ http://dtecht.blogspot.com
beagle / KDE fan
Mandriva / Inspiron-1100 user
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]