Thanks Grant and Stanislaw. To answer your question in terms of minimum term is, I am working with "joke text" very short in length so the clusters are not so meaning full.. I mean lot of adverbs and nouns, I thought increasing it might give me less cluster but bit more meaningful (maybe not).
--- Den ons 2009-04-22 skrev Grant Ingersoll <gsing...@apache.org>: > Från: Grant Ingersoll <gsing...@apache.org> > Ämne: Re: SOLR-769 clustering > Till: solr-user@lucene.apache.org > Datum: onsdag 22 april 2009 14.44 > > On Apr 21, 2009, at 3:46 AM, Antonio Eggberg wrote: > > > > > Hello: > > > > I have got the clustering working i.e SOLR-769. I am > wondering > > > > - why there is a filed called "body", does it have > special purpose? > > > > <field name="body" type="text" > indexed="true" stored="true" multiValued="true"/> > > > > That's just used in the test schema and there isn't any > need for you to use it. > > > > - can my clustering field be a copyField? basically I > like to remove the urls and html? > > As long as it is stored, a copyField should be fine. > > > > > > > - is there anyway to have minimum number of labels per > cluster? > > See Stanislaw's answer. > > -------------------------- > Grant Ingersoll > http://www.lucidimagination.com/ > > Search the Lucene ecosystem > (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene: > http://www.lucidimagination.com/search > > __________________________________________________________ Går det långsamt? Skaffa dig en snabbare bredbandsuppkoppling. Sök och jämför priser hos Kelkoo. http://www.kelkoo.se/c-100015813-bredband.html?partnerId=96914325