Re: Scoring of DisMax in Solr

2011-10-05 Thread Bill Bell
Markus, The calculation is correct. Look at your output. Result = queryWeight(text:gb) * fieldWeight(text:gb in 1) Result = (idf(docFreq=6, numDocs=26) * queryNorm) * (tf(termFreq(text:gb)=2) * idf(docFreq=6, numDocs=26) * fieldNorm(field=text, doc=1)) This you should notice that idf(docFreq=6

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
The example does not include the evidence. But we do use eDisMax for scoring in Solr. The following is from solrconfig.xml: edismax Here is a short snippet of the explained result, where 0.1 is the Tie breaker in DisMax/eDisMax. 6.446447 = (MATCH) max plus 0.1 times others of: 0.63826215

Re: Scoring of DisMax in Solr

2011-10-05 Thread Chris Hostetter
: Thanks! What's the procedure to report this if it's a bug? : EDisMax has similar behavior. what yo uare seeing isn't specific to dismax & edismax (in fact: there's no evidence in your example that dismax is even being used) what you are seeing is the basic scoring of a TermQuery using the D

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
Ok, here is the calculation of the score: 0.18314168 = *2.3121865* * 0.15502669 * 1.4142135 * *2.3121865* * 0.15625 *2.3121865 is *multiplied twice here. That is what I mean tf x idf^2 is used instead of tf x idf. On Wed, Oct 5, 2011 at 10:42 AM, Markus Jelsma wrote: > Hi, > > I don't see

Re: Scoring of DisMax in Solr

2011-10-05 Thread Markus Jelsma
Hi, I don't see 2.3121865 * 2 anywhere in your debug output or something that looks like that. > Hi Markus, > > The idf calculation itself is correct. > What I am trying to understand here is why idf value is multiplied twice > in the final score calculation. Essentially, tf x idf^2 is used

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
Hi Markus, The idf calculation itself is correct. What I am trying to understand here is why idf value is multiplied twice in the final score calculation. Essentially, tf x idf^2 is used instead of tf x idf. I'd like to understand the rational behind that. On Wed, Oct 5, 2011 at 9:43 AM, Ma

Re: Scoring of DisMax in Solr

2011-10-05 Thread Markus Jelsma
In Lucene's default similarity idf = 1 + ln (numDocs / df + 1). 1 + ln(26 / 7) =~ 2.3121865 I don't see a problem. > Hi, > > > When I examine the score calculation of DisMax in Solr, it looks to me > that DisMax is using tf x idf^2 instead of tf x idf. > Does anyone have insight why tf x id

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
Thanks! What's the procedure to report this if it's a bug? EDisMax has similar behavior. On Tue, Oct 4, 2011 at 11:24 PM, Bill Bell wrote: > This seems like a bug to me. > > On 10/4/11 6:52 PM, "David Ryan" wrote: > > >Hi, > > > > > >When I examine the score calculation of DisMax in Solr, it

Re: Scoring of DisMax in Solr

2011-10-04 Thread Bill Bell
This seems like a bug to me. On 10/4/11 6:52 PM, "David Ryan" wrote: >Hi, > > >When I examine the score calculation of DisMax in Solr, it looks to me >that DisMax is using tf x idf^2 instead of tf x idf. >Does anyone have insight why tf x idf is not used here? > >Here is the score contributio

Scoring of DisMax in Solr

2011-10-04 Thread David Ryan
Hi, When I examine the score calculation of DisMax in Solr, it looks to me that DisMax is using tf x idf^2 instead of tf x idf. Does anyone have insight why tf x idf is not used here? Here is the score contribution from one one field: score(q,c) = queryWeight x fieldWeight =