benwtrent opened a new issue, #14341: URL: https://github.com/apache/lucene/issues/14341
### Description What do we think about clustering or grouping documents by centroids, or potentially in chunks of filters and allow multiple graphs per segment. If segments are random sub-samples, it is conceivable that clusters are of common distributions, consequently we can quit searching clusters early and only search a couple of the clusters at a time. Seeing the optimistic query changes, makes me think we can maybe cluster, have one graph per cluster (or chunk of the ordered docs) and more eagerly enable early termination of the multi-graph searches. This is a crazy idea, maybe it ain't worth doing. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org