Hi, I am currently using solr 3.6.1 version and for indexing data, i am using the data import handler for 3.5 because of the reason posted in the following forum link http://lucene.472066.n3.nabble.com/Dataimport-Handler-in-solr-3-6-1-td4001149.html
I am trying to achieve result grouping based on a field "grpValue" which has value like this "Name XYZ|Company". There are totally 359 docs that were indexed and the field "grpValue" in all the 359 docs contains the word "Company" in its value. I gave the following in my schema.xml for splitting the word while indexing and querying <fieldType name="groupField" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.PatternTokenizerFactory" pattern="\s+|\|"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_new.txt" enablePositionIncrements="true" /> </analyzer> <analyzer type="query"> <tokenizer class="solr.PatternTokenizerFactory" pattern="\s+|\|"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_new.txt" enablePositionIncrements="true" /> </analyzer> </fieldType> I am trying to split the words if I have a single space or an “|” symbol in my data when i use the pattern="\s+|\|" in PatternTokenizerFactory. When I gave the analyze option in solr, the sample value was split inot 3 words "Name","XYZ","Company" in both my index and query analyzer. When i gave the following url http://localhost:8080/solr/core1/select/?q=*%3A*&version=2.2&start=0&rows=359&indent=on&group=true&group.field=grpValue&group.limit=0 I noticed that I have a grouping name called Company which has numFound as 73 but the particular field "grpValue" has the word "Company" in its value in all the 359 docs. Ideally, i should have got 359 docs as numFound under my group - <lst name="grouped"> - <lst name="grpValue"> <int name="matches">359</int> - <arr name="groups"> - <lst> <str name="groupValue">Company</str> <result name="doclist" numFound="73" start="0" /> </lst> Please someone guide me as to why only 73 docs is present in that group instead of 359. I also noticed that when I counted the numFound in all the groups, it totalled upto 359. Please guide me on this and I am not sure what I am missing. Please let me know in case more details is needed. Thanks in advance. -- View this message in context: http://lucene.472066.n3.nabble.com/Doubts-in-Result-Grouping-in-solr-3-6-1-tp4005239.html Sent from the Solr - User mailing list archive at Nabble.com.