Hi, I am attempting to cluster a query. It kinda works, but where my > (regular) query returns 500 results the cluster only shows 1-10 hits for > each cluster (5 clusters). Never more than 10 docs and I know its not > right. What could be happening here? It should be showing dozens of > documents per cluster. >
Just to clarify -- how many documents do you see in the response (<result name="response" /> section)? Clustering is performed on the search results (in real time), so if you request 10 results, clustering will apply only to those 10 results. To get a larger number of clusters you'd need to request more results, e.g. 50, 100, 200 etc. Obviously, the trade-off here is that it will take longer to fetch the documents from the index, clustering time will also increase. For some guidance on choosing the clustering algorithm, you can take a look at the following section of Carrot2 manual: http://download.carrot2.org/stable/manual/#section.advanced-topics.fine-tuning.choosing-algorithm . Cheers, Staszek