Hi Shamik,

You should be seeing the same result counts with grouping and the
CollapsingQParserPlugin, unless there are null values in the collapse
field. Let's see if we can figure out what the issue is.

Can you post the schema.xml field type definition for the ADSKDedup field?

Also can you post the full debug output for both queries?

Joel




Joel Bernstein
Search Engineer at Heliosearch


On Fri, Mar 14, 2014 at 6:22 PM, shamik <sham...@gmail.com> wrote:

> Hi,
>
>   I recently upgraded to 4.7, with the aim of replacing group queries with
> CollapsingQParserPlugin. As I'm comparing results between the two APIs,
> CollapsingQParserPlugin seems to be way off, in terms of relevancy and
> result count. Here's an example :
>
> *Group query*
>
> http://test-dev.mydomain.com/solr/adskhelpportal?fq=language:(english)&wt=xml&rows=40&start=0&fq=(ContentGroup-local
> :"Learn
> & Explore" OR ContentGroup-local:"Getting Started" OR
> ContentGroup-local:"Troubleshooting")&fq=Product:"PRD"&q=line&sort=score
>
> desc&group=true&group.field=ADSKDedup&group.ngroups=true&fl=title,ADSKDedup,score&debugQuery=true
>
> /Top 4 results/
> <lst name="grouped">
>       <lst name="ADSKDedup">
>          <int name="matches">14593</int>
>          <int name="ngroups">*13648*</int>
>          <arr name="groups">
>             <lst>
>                <str
> name="groupValue">fbfef4647e68c2300eba99028f2598a9</str>
>                <result name="doclist" numFound="1" start="0">
>                   <doc>
>                      <str
> name="ADSKDedup">fbfef4647e68c2300eba99028f2598a9</str>
>                      <arr name="title">
>                         <str>LINE</str>
>                      </arr>
>                      <float name="score">8.517085</float>
>                   </doc>
>                </result>
>             </lst>
>             <lst>
>                <str
> name="groupValue">GUID-E8C1190C-A26C-484C-ADDD-DDF81666F69F</str>
>                <result name="doclist" numFound="3" start="0">
>                   <doc>
>                      <arr name="title">
>                         <str>LINE (Command)</str>
>                      </arr>
>                      <str
> name="ADSKDedup">GUID-E8C1190C-A26C-484C-ADDD-DDF81666F69F</str>
>                   </doc>
>                </result>
>             </lst>
>             <lst>
>                <str
> name="groupValue">GUID-695722CD-A131-48DB-9AB8-162F0832FE04</str>
>                <result name="doclist" numFound="4" start="0">
>                   <doc>
>                      <str
> name="ADSKDedup">GUID-695722CD-A131-48DB-9AB8-162F0832FE04</str>
>                      <arr name="title">
>                         <str>About Controlling Extension Lines</str>
>                      </arr>
>                      <float name="score">5.1433907</float>
>                   </doc>
>                </result>
>             </lst>
>             <lst>
>                <str
> name="groupValue">GUID-9084DAC2-D5B7-4727-A443-205007A79440</str>
>                <result name="doclist" numFound="4" start="0">
>                   <doc>
>                      <arr name="title">
>                         <str>About Controlling Dimension Lines</str>
>                      </arr>
>                      <str
> name="ADSKDedup">GUID-9084DAC2-D5B7-4727-A443-205007A79440</str>
>                      <float name="score">5.1361656</float>
>                   </doc>
>                </result>
>             </lst>
>
>
> *CollapsingQParserPlugin query*
>
>
> http://test-dev.mydomain.com/solr/adskhelpportal?fq=language:(english)&wt=xml&rows=15&start=0&fq=(ContentGroup-local
> :"Learn
> & Explore" OR ContentGroup-local:"Getting Started" OR
>
> ContentGroup-local:"Troubleshooting")&fq=ProductLine:"PRD"&q=line&sort=score
> desc&fq={!collapse
> field=ADSKDedup}&fl=title,ADSKDedup,score&debugQuery=true
>
> /Top 4 results/
>  <result name="response" numFound="27142" start="0" maxScore="8.517085">
>       <doc>
>          <str name="ADSKDedup">fbfef4647e68c2300eba99028f2598a9</str>
>          <arr name="title">
>             <str>LINE</str>
>          </arr>
>          <float name="score">8.517085</float>
>       </doc>
>       <doc>
>          <str
> name="ADSKDedup">GUID-57CDDB6C-B12B-46CE-B9C5-22EFC17258FF</str>
>          <arr name="title">
>             <str>To Draw Lines</str>
>          </arr>
>          <float name="score">6.276938</float>
>       </doc>
>       <doc>
>          <arr name="title">
>             <str>Draw Lines</str>
>          </arr>
>          <str name="ADSKDedup">98b4a0e39400f0a216ff51a89922ce82</str>
>          <float name="score">6.224089</float>
>       </doc>
>       <doc>
>          <str name="ADSKDedup">4e51abdc0e8d30e77069505d93c1d4d4</str>
>          <arr name="title">
>             <str>Lines Tab</str>
>          </arr>
>          <float name="score">6.210026</float>
>       </doc>
>
> As you can see, the results are completely off, except for the first one.
> Moreover, the number of results returned are different as well. Group query
> has 13648 results which CollapsingQParserPlugin returns 27142, almost twice
> the size.
>
> I'm little baffled why the two APIs are returning different results for the
> same query. Are they fundamentally different ?
>
> Any pointers will be appreciated.
>
> -Thanks,
> Shamik
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/CollapsingQParserPlugin-returning-different-result-set-tp4123716.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to