yes there is a solr component if you want to cluster solr documents , check
the following link
http://wiki.apache.org/solr/ClusteringComponent

Carrot2 might be good if you want to cluster few thousands of documents ,
for example when user search solr , just cluster the  search results

Mahout is much more scalable and probably you need Hadoop for that


thanks
chandan

On Tue, Sep 4, 2012 at 2:10 PM, Denis Kuzmenok <forward...@ukr.net> wrote:

>
>
> -------- Original Message --------
> Subject: Solr Clustering
> From: Denis Kuzmenok <forward...@ukr.net>
> To: solr-user@lucene.apache.org
> CC:
>
> Hi, all.
> I know there is carrot2 and mahout for clustering. I want to implement
> such thing:
> I fetch documents and want to group them into clusters when they are added
> to index (i want to filter "similar" documents for example for 1 week). i
> need these documents quickly, so i cant rely on some postponed
> calculations. Each document should have assigned cluster id (like group
> similar documents into clusters and assign each document its cluster id.
> It's something similar to news aggregators like google news. I dont need
> to search for clusters with documents older than 1 week (for example). Each
> document will have its unique id and saved into DB. But solr will have
> cluster id field also.
> Is it possible to implement this with solr/carrot/mahout?




-- 
Chandan Tamrakar
*
*

Reply via email to