Hi,

Probably by writing your own Similarity (Lucene codebase) and implementing the 
following method with capping:

  /** Implemented as <code>sqrt(freq)</code>. */
  public float tf(float freq) {
    return (float)Math.sqrt(freq);
  }

Then put that custom Similarity in a jar in Solr's lib and specify your 
Similarity FQCN at the bottom of solrconfig.xml

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

----- Original Message ----
From: peter360 <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, April 11, 2008 2:16:53 PM
Subject: capping term frequency?


Hi,
How do I cap the term frequency when computing relevancy scores in solr?

The problem is if a keyword repeats many times in the same document, I don't
want it to hijack the relevancy score.  Can I tell solr to cap the term
frequency at a certain threshold?

thanks.
-- 
View this message in context: 
http://www.nabble.com/capping-term-frequency--tp16628189p16628189.html
Sent from the Solr - User mailing list archive at Nabble.com.




Reply via email to