Thanks for your recommendation Toke. Will try to ask in the carrot forum.
Regards, Edwin On 26 August 2015 at 18:45, Toke Eskildsen <t...@statsbiblioteket.dk> wrote: > On Wed, 2015-08-26 at 15:47 +0800, Zheng Lin Edwin Yeo wrote: > > > Now I've tried to increase the carrot.fragSize to 75 and > > carrot.summarySnippets to 2, and set the carrot.produceSummary to > > true. With this setting, I'm mostly able to get the cluster results > > back within 2 to 3 seconds when I set rows=200. I'm still trying out > > to see if the cluster labels are ok, but in theory do you think this > > is a suitable setting to attempt to improve the clustering results and > > at the same time improve the performance? > > I don't know - the quality/performance point as well as which knobs to > tweak is extremely dependent on your corpus and your hardware. A person > with better understanding of carrot might be able to do better sanity > checking, but I am not at all at that level. > > Related, it seems to me that the question of how to tweak the clustering > has little to do with Solr and a lot to do with carrot (assuming here > that carrot is the bottleneck). You might have more success asking in a > carrot forum? > > > - Toke Eskildsen, State and University Library, Denmark > > > >