Re: [PR] Speedup concurrent multi-segment HNWS graph search [lucene]

via GitHub Thu, 23 Nov 2023 09:04:52 -0800


mayya-sharipova commented on PR #12794:
URL: https://github.com/apache/lucene/pull/12794#issuecomment-1824736252


   @vigyasharma Answering other questions:
   
   > We seem to consistently see an improvement in recall between single 
segment, and multi-segment runs (both seq and conc.) on baseline. Is this 
because with multiple segments, we get multiple entry points into the overall 
graph? Whereas in a single merged segment, we only have access to a sparser set 
of nodes in layer-1 while finding the single best entry point?
   
   Indeed, this is the correct observation. For multiple segments, we retrieve 
k results from each segment, and then merge k* num_of_segments results to get 
the best `k` results. As we are retrieving more results, we also get better 
recall than from the single segment. 
   
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [PR] Speedup concurrent multi-segment HNWS graph search [lucene]

Reply via email to