Re: [I] Make HNSW merges faster [lucene]

via GitHub Thu, 20 Feb 2025 10:46:09 -0800


benwtrent commented on issue #12440:
URL: https://github.com/apache/lucene/issues/12440#issuecomment-2672345401


   Their LID technique almost feels like boot strapping with `log(n)` 
clusters...
   
   I wonder if we could simply gather `log(n)` clusters, then at merge time we 
merge like clusters with like clusters (potentially cutting down merging time), 
and then take the "centroids" and pushing them up to an upper layer. 
   
   This would be a more uniform layering logic than random, and could take 
advantage of distributed segment work at merge time (e.g. each segment already 
has their known centroids, etc.)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [I] Make HNSW merges faster [lucene]

Reply via email to