ChrisHegarty commented on PR #14980:
URL: https://github.com/apache/lucene/pull/14980#issuecomment-3155502316

   Thanks @mccullocht - not sure why you're not seeing improvement on 
Graviton2, but I'll post some more results that I see when testing across 
different platforms.
   
   Search latencies have improved by ~33%, and merge time 40-50%.  I some ideas 
about how to further improve indexing, but they can be done separately. 
   
   1M cohere 768d
   
   linux-x64 (m6i.2xlarge, x64, Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz, 
AVX 512)
   ```
   recall  latency(ms)  netCPU  avgCpuCount     nDoc  topK  fanout  maxConn  
beamWidth  quantized  index(s)  index_docs/s  force_merge(s)  num_segments  
index_size(MB)  vec_disk(MB)  vec_RAM(MB)  indexType
   baseline
   0.943        3.445   3.440        0.999  1000000   100      50       64      
  250         no   1303.77        767.00          582.92             1         
3022.19      2929.688     2929.688       HNSW
   candidate
   0.942        2.492   2.477        0.994  1000000   100      50       64      
  250         no    982.26       1018.06          325.12             1         
3020.78      2929.688     2929.688       HNSW
   ```
   
   linux-amd64 (m6a.4xlarge, AMD EPYC 7R13 Processor, AVX2 )
   ```
   recall  latency(ms)  netCPU  avgCpuCount     nDoc  topK  fanout  maxConn  
beamWidth  quantized  index(s)  index_docs/s  force_merge(s)  num_segments  
index_size(MB)  vec_disk(MB)  vec_RAM(MB)  indexType
   baseline
   0.943        3.276   3.256        0.994  1000000   100      50       64      
  250         no   1246.36        802.34          670.54             1         
3022.99      2929.688     2929.688       HNSW
   candidate
   0.944        2.003   1.989        0.993  1000000   100      50       64      
  250         no   1489.27        671.47          352.79             1         
3023.21      2929.688     2929.688       HNSW
   ```
   
   linux-arm (c6g.8xlarge, aarch64, Neoverse-N1)
   ```
   recall  latency(ms)  netCPU  avgCpuCount     nDoc  topK  fanout  maxConn  
beamWidth  quantized  index(s)  index_docs/s  force_merge(s)  num_segments  
index_size(MB)  vec_disk(MB)  vec_RAM(MB)  indexType
   baseline
   0.927        4.503   4.493        0.998  1000000   100      50       32      
  250         no    881.79       1134.06          766.93             1         
3014.53      2929.688     2929.688       HNSW
   candidate
   0.927        3.015   2.999        0.995  1000000   100      50       32      
  250         no    873.39       1144.96          322.33             1         
3013.33      2929.688     2929.688       HNSW
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to