Lucene/Solr does this automatically. That is how a tf.idf
engine works, it boosts rare words.

Do you have examples of problems or are you worrying about
something that might happen?

wunder

On 2/19/07 1:22 AM, "rubdabadub" <[EMAIL PROTECTED]> wrote:

> Hi:
> 
> I was wondering how are you guys dealing with "common words"? What I
> mean by common words  is the ones that fall outside the "stop words"
> category. Offcourse "stop words" is subjective i.e. its up to the
> implementor. What I would like to do is how do i increase or decrease
> boost value based on such "common words". Should I have a field
> "Common_Words_Plus" and "Common_Words_Minus"? Plus for words that
> needs to be boosted up and minus for the words that gets boosted
> down?.. No?
> 
> The above sounds like not so professional -- quick fix.. does any one
> have a better solution.. how are you dealing with the above?
> 
> Regards

Reply via email to