Thanks Sujit, Mikhail for you suggestions Sujit - Continuing to do it at client side increases one extra cycle between server and the client. Moreover it does not remain centralized, so I may have to repeat client side logic to multiple places, depending upon how it is implemented.
Mikhail - More Like This (mlt) is different than what I require. I am guessing the best matching categories for a "set of documents" and then filtering through only those categories, while mlt finds similar documents for "individual" documents. There are two things which will not work for me - 1. Mlt suggests similar documents for individual documents, while I am working on aggregated result set. 2. Mlt is based on the same scoring mechanism which was not providing relevant results and that's why I moved to this 2 query system I was wondering that many people might have thought about this but did not find if anybody worked on this. Is it a bad idea? Or something which has repercussions (other than slightly increased response time)? Thanks again On Wed, May 16, 2012 at 11:58 AM, Mikhail Khludnev < mkhlud...@griddynamics.com> wrote: > Hello, > > have you checked MoreLikeThis feature? > > On Tue, May 15, 2012 at 11:26 PM, Samarendra Pratap <samarz...@gmail.com > >wrote: > > > - We are calculating frequency of category ids in these top results. We > > are not using facets because that gives count for all, relevant or > > irrelevant, results. > > - Based on category frequencies within top matching results we are > > trying to find a few most frequent categories by simple calculation. > Now > > we > > are very confident that these categories are the ones which best suit > to > > our query. > > > > > > -- > Sincerely yours > Mikhail Khludnev > Tech Lead > Grid Dynamics > > <http://www.griddynamics.com> > <mkhlud...@griddynamics.com> > -- Regards, Samar