Hi, We want to remove some results from the result set based on the result of some algorithms on some fields in adjacent documents. For example, if doc2 resembles or doc1 we want to remove it. We cannot do this in a search component because of problems with paging, maintaining rows=N results despite removal of some records etc. Instead i'd like to try to override the TopScoreDocCollector in SolrIndexSearcher and implement Collector.collect(int doc), however, the Javadoc states that it's not a good idea to use IndexSearcher or IndexReader to obtain the document and some fields.
Any hints to share to keep up performance in the collector? Any other ideas except implement as search component or use field collapsing? Thanks, Markus