That is correct. In fact, any three letter combinations.thedqs wrote:Sounds good though from how Tom made it sound this won't be just a green list of acronyms but all 3 letter words. When will the forums be done?
Tom
I can't remember what version of MySQL you are using but here are some references to updating your stopwords list. Basically you create your own list and override the default stopword list by setting the ft_stopword_file system variable.tomw wrote:Yes it does but I'm not sure where that "common word" list is located.
Tom
A blacklist for short, frequently used words with essentially zero information gain is a good idea. If I understand correctly, most conventional indexing and searching systems use such a list.thedqs wrote:I remember that we talked before about shortening the search to 3 letters but then you have all the problems with "the" "him" "and" which causes a lot of server slowdown. Of course you could add a word blacklist. It is currently in Tom's hands. As for xml I agree that would be nice especially for 3rd part apps to have your own LDS Tech form reader or something. There is an RSS for the new threads created though.
Thanks, I'll look into it.bhofmann wrote:I can't remember what version of MySQL you are using but here are some references to updating your stopwords list. Basically you create your own list and override the default stopword list by setting the ft_stopword_file system variable.
Here is where it talks about it, http://dev.mysql.com/doc/refman/5.0/en/ ... uning.html.
Here is the current stopword list if you wanted a list to to start with, http://dev.mysql.com/doc/refman/5.0/en/ ... words.html.