Hello 
I wish to implement cluster based retrieval model in lucene. I havent gone
through the code fully and am unaware of any existing implementations for it
based on lucene. 
Could someone give me a heads up on where to begin .. as there is too much
of code to go through and I have very little time. 

Now my idea is .. 

Lucene index should be created in form of clusters . Ie At indexing time
each Document (D) could belong to a cluster. 
On Query (Q) submission the each cluster is searched for relevant documents.
And the documents from that cluster as well as other clusters are ranked. 

A brute force way of implementing it could be 
1. Clusters are denoted by a field Name -- cluster (C).
2. the words are search in cluster field. 
3. The scoring functions are changed to incorporate the math used in cluster
retrieval
4. Documents in each cluster ranked seperately. And then merged

Now the problem with this approach is many queries will have to be created
and the result processing will increase considerably...
If there are more ways to doing it please lemme know .

Regards
Mansi

-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Cluster-Retrieval-in-Lucene-tp1968500p1968500.html
Sent from the Lucene - General mailing list archive at Nabble.com.

Reply via email to