atris commented on issue #14341: URL: https://github.com/apache/lucene/issues/14341#issuecomment-2714751735
It's actually crazy - I was thinking of starting a discussion on this today. One thing that I have been playing with is creating clusters with centroids that are at a certain range distance from a given reference point, and then using a triangulation method during search to "estimate" the distance of faraway clusters and only traverse their members if any point seems to be a viable candidate for topN. This is especially useful for disconnected data sets, where some relevant points might be distributed far from each other and early termination tends to ignore them. Are you actively working on this? Or would you like me to explore more? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org