javanna commented on code in PR #13542: URL: https://github.com/apache/lucene/pull/13542#discussion_r1731275591
########## lucene/core/src/java/org/apache/lucene/search/TotalHitCountCollectorManager.java: ########## @@ -28,17 +31,77 @@ */ public class TotalHitCountCollectorManager implements CollectorManager<TotalHitCountCollector, Integer> { + + /** + * Internal state shared across the different collectors that this collector manager creates. It + * tracks leaves seen as an argument of {@link Collector#getLeafCollector(LeafReaderContext)} + * calls, to ensure correctness: if the first partition of a segment early terminates, count has + * been already retrieved for the entire segment hence subsequent partitions of the same segment + * should also early terminate. If the first partition of a segment computes hit counts, + * subsequent partitions of the same segment should do the same, to prevent their counts from + * being retrieve from {@link LRUQueryCache} (which returns counts for the entire segment) + */ + private final Map<LeafReaderContext, Boolean> seenContexts = new HashMap<>(); + @Override public TotalHitCountCollector newCollector() throws IOException { - return new TotalHitCountCollector(); + return new LeafPartitionAwareTotalHitCountCollector(seenContexts); } @Override public Integer reduce(Collection<TotalHitCountCollector> collectors) throws IOException { + // TODO this makes the collector manager instance reusable across multiple searches. It isn't a + // strict requirement Review Comment: ehm, good point, I was hoping that gradlew tidy does what it has to do. I think it was me who went on a new line randomly when typing, not my IDE :) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org