benwtrent commented on PR #12551: URL: https://github.com/apache/lucene/pull/12551#issuecomment-1720048714
@jimczi I like this idea at first glance, but I have one major concern. What about data that is indexed according to a specific order? Two tests to verify how this behaves would be: - Index the cohere data sorted by magnitude - Cluster the data and index sorted by each assigned cluster While typical things should be randomly distributed, any option we provide here should be escapable or at least be able to handle those situations (potentially but just increasing efSearch as a whole). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org