Re: questions about Clustering

2009-05-23 Thread Stanislaw Osinski
> > Hmm, I saw the comment in ClusteringDocumentList.java of Carrot2: > > /* > * If you know what query generated the documents you're about to cluster, > pass > * the query to the algorithm, which will usually increase clustering > quality. > */ > attributes.put(AttributeNames.QUERY, "data mining"

Re: questions about Clustering

2009-05-23 Thread Stanislaw Osinski
> > 1. if q=*:* is requested, Carrot2 will receive "MatchAllDocsQuery" >> via attributes. Is it OK? >> > > Yes, it only clusters on the Doc List, not the Doc Set (in other words, > it's your rows that matter) Just to add to that: Carrot2 should be able to cluster up to ~1000 search results, but b

Re: questions about Clustering

2009-05-23 Thread Koji Sekiguchi
Grant Ingersoll wrote: On May 22, 2009, at 11:41 PM, Koji Sekiguchi wrote: I'm thinking using clustering (SOLR-769) function for my project. I have a couple of questions: 1. if q=*:* is requested, Carrot2 will receive "MatchAllDocsQuery" via attributes. Is it OK? Yes, it only clusters on t

Re: questions about Clustering

2009-05-23 Thread Grant Ingersoll
On May 22, 2009, at 11:41 PM, Koji Sekiguchi wrote: I'm thinking using clustering (SOLR-769) function for my project. I have a couple of questions: 1. if q=*:* is requested, Carrot2 will receive "MatchAllDocsQuery" via attributes. Is it OK? Yes, it only clusters on the Doc List, not the Doc

questions about Clustering

2009-05-22 Thread Koji Sekiguchi
I'm thinking using clustering (SOLR-769) function for my project. I have a couple of questions: 1. if q=*:* is requested, Carrot2 will receive "MatchAllDocsQuery" via attributes. Is it OK? 2. I'd like to use it on an environment other than English, e.g. Japanese. I've implemented Carrot2Japanese