Re: [PR] Introduce growInRange to reduce array overallocation [lucene]

via GitHub Wed, 29 Nov 2023 13:33:36 -0800


stefanvodita commented on PR #12844:
URL: https://github.com/apache/lucene/pull/12844#issuecomment-1832732598


   Thanks for the suggestions @zhaih! I have to think about option 2 a bit more.
   
   If we change `ramBytesUsed` back, performance recovers (Mostly? I'm not sure 
how noisy this benchmark is).
   
   Benchmark config:
   ```
   dim = 100
   doc_vectors = '%s/data/enwiki-20120502-lines-1k-100d.vec' % 
constants.BASE_DIR
   query_vectors = '%s/util/tasks/vector-task-minilm.vec' % constants.BASE_DIR
   ```
   
   main:
   ```
   recall  latency nDoc    fanout  maxConn beamWidth       visited index ms
   0.720    0.16   10000   0       64      250     100     2551    1.00    
post-filter
   0.542    0.23   100000  0       64      250     100     43620   1.00    
post-filter
   0.512    0.27   200000  0       64      250     100     108235  1.00    
post-filter
   ```
   
   This PR:
   ```
   recall  latency nDoc    fanout  maxConn beamWidth       visited index ms
   0.720    0.16   10000   0       64      250     100     5028    1.00    
post-filter
   0.542    0.23   100000  0       64      250     100     315369  1.00    
post-filter
   0.512    0.28   200000  0       64      250     100     1578461 1.00    
post-filter
   ```
   
   This PR with the old memory estimation:
   ```
   recall       latency nDoc    fanout  maxConn beamWidth       visited index ms
   0.720         0.16   10000   0       64      250     100     2497    1.00    
post-filter
   0.542         0.23   100000  0       64      250     100     43886   1.00    
post-filter
   0.512         0.28   200000  0       64      250     100     117152  1.00    
post-filter
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [PR] Introduce growInRange to reduce array overallocation [lucene]

Reply via email to