Re: [PR] Introduces IndexInput#updateReadAdvice to change the ReadAdvice while merging vectors [lucene]

via GitHub Mon, 18 Nov 2024 12:13:20 -0800


shatejas commented on PR #13985:
URL: https://github.com/apache/lucene/pull/13985#issuecomment-2484035773


   ### Benchmarks
   
   #### Setup 1 - Opensearch cluster
   
   Ran with [opensearch 
benchmarks](https://github.com/opensearch-project/opensearch-benchmark)
   
   Total data nodes - 3
   Total shards - 6 (2 per node), no replicas
   Memory - 128gb
   vCPU - 16
   
   Dataset used: cohere-10m
   
   Baseline - OS 2.18 and lucene 9.12
   Candidate - OS 2.16 and lucene [9.12 with readAdvice 
changes](https://github.com/apache/lucene/compare/branch_9_12...shatejas:lucene:branch_9_12)
   
   **Why was this tested with lucene 9.12?**
   Opensearch is not using lucene >9.12 for any of its version. Upgrading it to 
use lucene 10 requires significant changes. For candidate, required commits 
were cherry-picked
   
   
   Run 1: sequence of operations: delete-index -> create-index -> add documents 
-> force-merge -> search
   
   ##### Results
   |           | Force-merge(ms)   | Force-merge(hrs) | Search p50 | Search p90 
| Search p99 |
   
|-----------|-------------------|------------------|------------|------------|------------|
   | Baseline  | 15795889.88313920 | 4hrs 23 mins     | 9.6        | 10.8       
| 14.7       |
   | Candidate | 15204143.95724240 | 4hrs 13mins      | 10.7       | 12.0       
| 15.0       |
   
   Run 2:  Search performed on already indexed data from above run
   
   |           | Search p50 | Search p90 | Search p99 |
   |-----------|------------|------------|------------|
   | Baseline  | 9.7        | 10.6       | 12.1       |
   | Candidate | 10.4       | 11.3       | 12.5       |
   
   #### Setup 2: Used lucene-utils knnPerfTest.py
   
   Baseline - Lucene main
   Candidate - Lucene main with current commit
   
   **Baseline**
   
   | recall |  latency (ms)  | nDoc | topK  |  fanout   | maxConn |   beamWidth 
|   quantized  |  index s   | index docs/s  |  force merge s   | num segments  
|  index size (MB) | 
   
|-----------|------------|------------|------------|-----------|------------|------------|------------|-----------|------------|------------|------------|------------|
   | 0.644  |       0.428 | 50000 |   10 |     64 |      64  |      250  |      
 no |   18.97  |     2635.18  |         1.89   |          1    |        20.62 |
   
   **Candidate**
   
   | recall |  latency (ms)  | nDoc | topK  |  fanout   | maxConn |   beamWidth 
|   quantized  |  index s   | index docs/s  |  force merge s   | num segments  
|  index size (MB) | 
   
|-----------|------------|------------|------------|-----------|------------|------------|------------|-----------|------------|------------|------------|------------|
   |  0.644    |      0.436 |   50000  |   10    |   64   |     64      |   250 
   |      no  |   20.20    |    2474.76    |        1.77  |            1   |    
      20.62 | 
   
   There is a small affect on search latencies, its hard to say if its due to 
the change or just a fluctuation in the runs. I couldn't think of a reason that 
would of search latencies
   
   @jpountz @ChrisHegarty thoughts?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [PR] Introduces IndexInput#updateReadAdvice to change the ReadAdvice while merging vectors [lucene]

Reply via email to