benwtrent commented on PR #14304:
URL: https://github.com/apache/lucene/pull/14304#issuecomment-2697969649

   I compared this branch with main. There are measurable improvements, but the 
quantization step isn't the main bottle neck. Vector comparisons still dominate 
the costs. But, its a nice bump I would say.
   
   candidate:
   ```
   recall  latency (ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
index s  index docs/s  force merge s  num segments  index size (MB)  vec disk 
(MB)  vec RAM (MB)
    0.826         2.340  500000   100      50       32        100     7 bits    
86.54       5777.61         337.47             1          1859.34       
1831.055       366.211
   ```
   
   baseline:
   ```
   recall  latency (ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
index s  index docs/s  force merge s  num segments  index size (MB)  vec disk 
(MB)  vec RAM (MB)
    0.828         2.680  500000   100      50       32        100     7 bits    
88.48       5650.74         357.45             1          1859.57       
1831.055       366.211
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to