Hi, all. I know there is carrot2 and mahout for clustering. I want to implement
such thing: I fetch documents and want to group them into clusters when they
are added to index (i want to filter "similar" documents for example for 1
week). i need these documents quickly, so i cant rely on some postponed
calculations. Each document should have assigned cluster id (like group similar
documents into clusters and assign each document its cluster id. It's something
similar to news aggregators like google news. I dont need to search for
clusters with documents older than 1 week (for example). Each document will
have its unique id and saved into DB. But solr will have cluster id field also.
Is it possible to implement this with solr/carrot/mahout?