Re: MySQL Stop Words

From: Casey Bisson <cbisson_at_nyob>
Date: Mon, 1 Jun 2009 12:15:14 -0400
To: CODE4LIB_at_LISTSERV.ND.EDU
The minimum word length and stop word list are run-time configurable.  
The exclusion of words that are in more than 50% of the corpus is a  
compile-time issue (or simply use boolean). Here are the settings to  
be aware of:

ft_min_word_len=3
ft_stopword_file=/dev/null

--Casey

http://about.scriblio.net/
http://maisonbisson.com/


On Jun 1, 2009, at 11:13 AM, Mike Taylor wrote:

> However, all of these oddities -- over eager stop-list, ignoring short
> words, not counting words in more than half the rows -- can be sorted
> out by configuration options.
Received on Mon Jun 01 2009 - 12:18:47 EDT