Greetings, We've been experimenting with grouping fields returned from document search results in Lucene, and we haven't gotten anything very encouraging. Basically, the more results we return, the longer it takes -- tens of seconds. Probably because we're doing expensive disks seeks. I'm hoping the SOLR crew out there may provide some insight :)
What we're trying to do is similar to SQL's "GROUP BY". Let's say we have documents indexed by keyword for a content body, and also indexed by an Author name. If I search our document store (very large) for the word "laptop", I would like to be able to calculate the 10 authors that appeared the most. I've done some searching through the mailing list, but couldn't glean much insight. What do you think? -- http://www.roadtofailure.com -- The Fringes of Scalability, Social Media, and Computer Science