Lucene/Solr does this automatically. That is how a tf.idf engine works, it boosts rare words.
Do you have examples of problems or are you worrying about something that might happen? wunder On 2/19/07 1:22 AM, "rubdabadub" <[EMAIL PROTECTED]> wrote: > Hi: > > I was wondering how are you guys dealing with "common words"? What I > mean by common words is the ones that fall outside the "stop words" > category. Offcourse "stop words" is subjective i.e. its up to the > implementor. What I would like to do is how do i increase or decrease > boost value based on such "common words". Should I have a field > "Common_Words_Plus" and "Common_Words_Minus"? Plus for words that > needs to be boosted up and minus for the words that gets boosted > down?.. No? > > The above sounds like not so professional -- quick fix.. does any one > have a better solution.. how are you dealing with the above? > > Regards