Hi,

I am currently using solr 3.6.1 version and for indexing data, i am using
the data import handler for 3.5 because of the reason posted in the
following forum link
http://lucene.472066.n3.nabble.com/Dataimport-Handler-in-solr-3-6-1-td4001149.html

I am trying to achieve result grouping based on a field "grpValue" which has
value like this "Name XYZ|Company". There are totally 359 docs that were
indexed and the field "grpValue" in all the 359 docs contains the word
"Company" in its value.

I gave the following in my schema.xml for splitting the word while indexing
and querying

<fieldType name="groupField" class="solr.TextField"
positionIncrementGap="100">
 <analyzer type="index">  
<tokenizer class="solr.PatternTokenizerFactory" pattern="\s+|\|"/>

      <filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords_new.txt" enablePositionIncrements="true" />
 </analyzer>
 <analyzer type="query">

 <tokenizer class="solr.PatternTokenizerFactory" pattern="\s+|\|"/>
      <filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords_new.txt" enablePositionIncrements="true" />
 </analyzer>
</fieldType>


I am trying to split the words if I have a single space or an “|” symbol in
my data when i use the pattern="\s+|\|" in PatternTokenizerFactory. 

When I gave the analyze option in solr, the sample value was split inot 3
words "Name","XYZ","Company" in both my index and query analyzer.

When i gave the following url

http://localhost:8080/solr/core1/select/?q=*%3A*&version=2.2&start=0&rows=359&indent=on&group=true&group.field=grpValue&group.limit=0

I noticed that I have a grouping name called Company which has numFound as
73 but the particular field "grpValue" has the word "Company" in its value
in all the 359 docs. Ideally, i should have got 359 docs as numFound under
my group

- <lst name="grouped">
- <lst name="grpValue">
  <int name="matches">359</int> 
- <arr name="groups">
- <lst>
  <str name="groupValue">Company</str> 
  <result name="doclist" numFound="73" start="0" /> 
  </lst>

Please someone guide me as to why only 73 docs is present in that group
instead of 359.

I also noticed that when I counted the numFound in all the groups, it
totalled upto 359. 


Please guide me on this and I am not sure what I am missing. Please let me
know in case more details is needed.

Thanks in advance.  






--
View this message in context: 
http://lucene.472066.n3.nabble.com/Doubts-in-Result-Grouping-in-solr-3-6-1-tp4005239.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to