Hi,
I am currently using solr 3.6.1 version and for indexing data, i am using
the data import handler for 3.5 because of the reason posted in the
following forum link
http://lucene.472066.n3.nabble.com/Dataimport-Handler-in-solr-3-6-1-td4001149.html
I am trying to achieve result grouping based on a field "grpValue" which has
value like this "Name XYZ|Company". There are totally 359 docs that were
indexed and the field "grpValue" in all the 359 docs contains the word
"Company" in its value.
I gave the following in my schema.xml for splitting the word while indexing
and querying
<fieldType name="groupField" class="solr.TextField"
positionIncrementGap="100">
<analyzer type="index">
<tokenizer class="solr.PatternTokenizerFactory" pattern="\s+|\|"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords_new.txt" enablePositionIncrements="true" />
</analyzer>
<analyzer type="query">
<tokenizer class="solr.PatternTokenizerFactory" pattern="\s+|\|"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords_new.txt" enablePositionIncrements="true" />
</analyzer>
</fieldType>
I am trying to split the words if I have a single space or an “|” symbol in
my data when i use the pattern="\s+|\|" in PatternTokenizerFactory.
When I gave the analyze option in solr, the sample value was split inot 3
words "Name","XYZ","Company" in both my index and query analyzer.
When i gave the following url
http://localhost:8080/solr/core1/select/?q=*%3A*&version=2.2&start=0&rows=359&indent=on&group=true&group.field=grpValue&group.limit=0
I noticed that I have a grouping name called Company which has numFound as
73 but the particular field "grpValue" has the word "Company" in its value
in all the 359 docs. Ideally, i should have got 359 docs as numFound under
my group
- <lst name="grouped">
- <lst name="grpValue">
<int name="matches">359</int>
- <arr name="groups">
- <lst>
<str name="groupValue">Company</str>
<result name="doclist" numFound="73" start="0" />
</lst>
Please someone guide me as to why only 73 docs is present in that group
instead of 359.
I also noticed that when I counted the numFound in all the groups, it
totalled upto 359.
Please guide me on this and I am not sure what I am missing. Please let me
know in case more details is needed.
Thanks in advance.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Doubts-in-Result-Grouping-in-solr-3-6-1-tp4005239.html
Sent from the Solr - User mailing list archive at Nabble.com.