benwtrent commented on issue #12440: URL: https://github.com/apache/lucene/issues/12440#issuecomment-2672345401
Their LID technique almost feels like boot strapping with `log(n)` clusters... I wonder if we could simply gather `log(n)` clusters, then at merge time we merge like clusters with like clusters (potentially cutting down merging time), and then take the "centroids" and pushing them up to an upper layer. This would be a more uniform layering logic than random, and could take advantage of distributed segment work at merge time (e.g. each segment already has their known centroids, etc.) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org