This is an old post, now there is a solution in SOLR
omitTermFreqAndPositions="true"
http://wiki.apache.org/solr/SchemaXml#Data_Types
--
View this message in context:
http://lucene.472066.n3.nabble.com/Disabling-tf-term-frequency-during-indexing-and-or-scoring-tp502956p4062595.html
Sent from
com]
Sent: Friday, September 18, 2009 8:33 AM
To: solr-user@lucene.apache.org
Subject: Re: Disabling tf (term frequency) during indexing and/or scoring
Hi Yonik,
For my particular needs, IDF considerations are fine and helpful; if a
user is requesting a rare term/phrase, increasing the score base
Hi Yonik,
For my particular needs, IDF considerations are fine and helpful; if a
user is requesting a rare term/phrase, increasing the score based on
that makes sense as the match has higher confidence. I simply need to
compensate for title and category type fields that may contain redundant
On Fri, Sep 18, 2009 at 11:05 AM, Aaron McKee wrote:
> I wonder, though, if it could also make sense to support a
> query-time only boolean to optionally disable TF independently, on a
> per-field basis?
I guess it could make sense. But do you still want idf too? length
norm? or do you really wa
Hi Yonik,
Thank you for the explanation. If the primary goal was to save index
space for a very specific subclass of fields, the implementation
certainly makes more sense. I wonder, though, if it could also make
sense to support a query-time only boolean to optionally disable TF
independentl
Though it would be possible to calculate a binary tf, where the score
is 1 if there are one or more occurances of the term. --wunder
On Sep 18, 2009, at 7:08 AM, Yonik Seeley wrote:
On Fri, Sep 18, 2009 at 9:38 AM, Aaron McKee
wrote:
I suppose I'm curious why the omitTfAndPositions option
On Fri, Sep 18, 2009 at 9:38 AM, Aaron McKee wrote:
> I suppose I'm curious why the omitTfAndPositions option conflates two
> apparently independent features.
This relates to the index format, and is more for performance/size
benefits when they are not needed. In the index, it's impossible to
om
Hi Alexey,
Thank you for your suggestion! My understanding of Similarity, though,
is that this would affect the entire index, whereas I need something
that is field-configurable. Looking at Similarity.tf(), it seems to be
independent of the field (and unaware of it). I don't necessarily want
Just FYI - you can put Solr plugins in /lib as JAR files
rather than messing with solr.war
Erik
On Sep 16, 2009, at 10:15 AM, Alexey Serba wrote:
Hi Aaron,
You can overwrite default Lucene Similarity and disable tf and
lengthNorm factors in scoring formula ( see
http://lucene.apache
Hi Aaron,
You can overwrite default Lucene Similarity and disable tf and
lengthNorm factors in scoring formula ( see
http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/search/Similarity.html
and http://lucene.apache.org/java/2_4_1/api/index.html )
You need to
1) compile the following clas
10 matches
Mail list logo