[GitHub] [lucene] msokolov commented on pull request #315: Disk write and read of hnsw graph

GitBox Fri, 24 Sep 2021 09:25:33 -0700


msokolov commented on pull request #315:
URL: https://github.com/apache/lucene/pull/315#issuecomment-926762372



   I did run a quick benchmark on an internal data-set (using KnnGraphTester) 
and saw some improvement to both latency and recall, and a reduction in nodes 
visited from this change:
   
   condition|recall|  latency| nDoc       |    fanout|  maxConn| beamWidth|   
visited| index ms
   
------------|-------|-----------|--------------|------------|---------------|---------------|------------|----------------
   main       |0.815|     0.91|   1000000|          50|              32|        
      64|       2590|    484529
   change   |0.836|    0.86 |   1000000 |         50|              32|          
    64|       2331|    462900
   
   and I think this test is actually unfair to HNSW because of the fanout which 
incurs some additional cost in the HNSW case for no benefit (it just increases 
top K by that amount and then throws away the extras).
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene] msokolov commented on pull request #315: Disk write and read of hnsw graph

Reply via email to