Joel, To give you some context, we are running queries against 6 million documents in a Solr cloud environment. The grouping is done to de-duplicate content based on an unique field. Unfortunately, due to some requirement constraint, the only way for us to run the de-duplication is during query time.
The group numbers are pretty high in our case. Average distinct group is around 1000. The total number of distinct group for the field is around 10k. Phrase queries are especially worse,averaging a response time of 10-12 secs. Having said that, CollapsingQParserPlugin makes a huge difference in performance, only caveat being the lack of support for "group.facets" equivalent. I had this discussion earlier with you where you had confirmed it http://lucene.472066.n3.nabble.com/RE-SOLR-6143-Bad-facet-counts-from-CollapsingQParserPlugin-td4140455.html#a4146645 Are there any plans to address this ? Not sure if it's a big change at your end, but if something we can contribute to add it, I'm more than happy to help. I know there are a bunch of people who are looking forward to this. -- View this message in context: http://lucene.472066.n3.nabble.com/Does-DocValues-improve-Grouping-performance-tp4179926p4184295.html Sent from the Solr - User mailing list archive at Nabble.com.