How to obtain the Explained output programmatically ?

2011-10-03 Thread David Ryan
Hi, I need to use some detailed information of the explained result in Solr search. Here is one example: * http://localhost:8983/solr/select/?q=GB&version=2.2&start=0&rows=10&indent=on&debugQuery=true&fl=id,score * 0.18314168 = (MATCH) sum of: 0.18314168 = (MATCH) weight(text:gb in 1), p

Re: How to obtain the Explained output programmatically ?

2011-10-03 Thread David Ryan
Thanks Hoss! debug.explain.structured is definitely helpful. It adds some structure to the plain explained output. Is there a way to access these structured outputs in Java code (e.g., via Solr plugin class)? We could wr

Scoring of DisMax in Solr

2011-10-04 Thread David Ryan
Hi, When I examine the score calculation of DisMax in Solr, it looks to me that DisMax is using tf x idf^2 instead of tf x idf. Does anyone have insight why tf x idf is not used here? Here is the score contribution from one one field: score(q,c) = queryWeight x fieldWeight =

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
Thanks! What's the procedure to report this if it's a bug? EDisMax has similar behavior. On Tue, Oct 4, 2011 at 11:24 PM, Bill Bell wrote: > This seems like a bug to me. > > On 10/4/11 6:52 PM, "David Ryan" wrote: > > >Hi, > > > > > >When

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
Hi Markus, The idf calculation itself is correct. What I am trying to understand here is why idf value is multiplied twice in the final score calculation. Essentially, tf x idf^2 is used instead of tf x idf. I'd like to understand the rational behind that. On Wed, Oct 5, 2011 at 9:43 AM, Ma

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
Ok, here is the calculation of the score: 0.18314168 = *2.3121865* * 0.15502669 * 1.4142135 * *2.3121865* * 0.15625 *2.3121865 is *multiplied twice here. That is what I mean tf x idf^2 is used instead of tf x idf. On Wed, Oct 5, 2011 at 10:42 AM, Markus Jelsma wrote: > Hi, > > I don't see

New scoring models in LUCENE/SOLR (LUCENE-2959)

2011-10-05 Thread David Ryan
Hi, According to the IRA issue 2959, https://issues.apache.org/jira/browse/LUCENE-2959 BM25 will be included in the next release of LUCENE. 1). Will BM25F be included in the next release as well as part of LUCENE-2959? 2). What's the timeline of the next release that new scoring modules will be

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
The example does not include the evidence. But we do use eDisMax for scoring in Solr. The following is from solrconfig.xml: edismax Here is a short snippet of the explained result, where 0.1 is the Tie breaker in DisMax/eDisMax. 6.446447 = (MATCH) max plus 0.1 times others of: 0.63826215

Re: New scoring models in LUCENE/SOLR (LUCENE-2959)

2011-10-05 Thread David Ryan
Do you mean both BM25 and BM25F? On Wed, Oct 5, 2011 at 11:44 AM, Robert Muir wrote: > On Wed, Oct 5, 2011 at 2:23 PM, David Ryan wrote: > > Hi, > > > > According to the IRA issue 2959, > > https://issues.apache.org/jira/browse/LUCENE-2959 > > > > BM25