tteofili commented on PR #14094:
URL: https://github.com/apache/lucene/pull/14094#issuecomment-2740827581

   additional experiments with different quantization levels and filtering:
   
   ## No-fitlering
   
   ### Baseline
   ```
   recall  latency(ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
index(s)  index_docs/s  num_segments  index_size(MB)  vec_disk(MB)  vec_RAM(MB) 
 indexType
    0.985        4.620  200000   100      50       64        250         no    
106.46       1878.64             3          600.08       585.938      585.938   
    HNSW
    0.899        3.657  200000   100      50       64        250     7 bits     
67.74       2952.47             5          746.34       733.185      147.247    
   HNSW
    0.585        2.328  200000   100      50       64        250     4 bits     
46.86       4268.03             3          675.33       659.943       74.005    
   HNSW
    0.983        9.212  500000   100      50       64        250         no    
235.68       2121.56             8         1501.44      1464.844     1464.844   
    HNSW
    0.900        7.562  500000   100      50       64        250     7 bits    
165.99       3012.30             9         1867.29      1832.962      368.118   
    HNSW
    0.580        4.934  500000   100      50       64        250     4 bits    
130.65       3826.96             8         1689.29      1649.857      185.013   
    HNSW
   ```
   ### Candidate
   ```
   recall  latency(ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
visited  index(s)  index_docs/s  num_segments  index_size(MB)  selectivity  
vec_disk(MB)  vec_RAM(MB)  indexType
    0.980        3.744  200000   100      50       64        250         no    
10690    106.82       1872.29             3          600.10         1.00       
585.938      585.938       HNSW
    0.896        3.473  200000   100      50       64        250     7 bits    
11878     68.83       2905.54             5          746.39         1.00       
733.185      147.247       HNSW
    0.585        2.032  200000   100      50       64        250     4 bits    
13279     51.32       3897.12             3          675.32         1.00       
659.943       74.005       HNSW
    0.982        8.549  500000   100      50       64        250         no    
23079    248.29       2013.81             8         1501.32         1.00      
1464.844     1464.844       HNSW
    0.898        6.733  500000   100      50       64        250     7 bits    
23629    167.17       2991.02             9         1867.31         1.00      
1832.962      368.118       HNSW
    0.581        3.776  500000   100      50       64        250     4 bits    
21179    152.43       3280.24             5         1690.38         1.00      
1649.857      185.013       HNSW
   ```
   
   ## Filtering 
   
   ### Baseline
   ```
   recall  latency(ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
visited  index(s)  index_docs/s  num_segments  index_size(MB)  selectivity  
vec_disk(MB)  vec_RAM(MB)  indexType
    1.000        0.642  200000   100      50       64        250         no     
1965    109.81       1821.26             3          600.16         0.01       
585.938      585.938       HNSW
    0.964        4.947  200000   100      50       64        250         no     
9504    110.91       1803.33             3          600.11         0.10       
585.938      585.938       HNSW
    0.983        8.417  200000   100      50       64        250         no    
22193    103.13       1939.28             3          600.09         0.50       
585.938      585.938       HNSW
    0.918        0.762  200000   100      50       64        250     7 bits     
1981     64.33       3108.82             5          746.33         0.01       
733.185      147.247       HNSW
    0.892        4.310  200000   100      50       64        250     7 bits    
10302     66.23       3019.87             5          746.34         0.10       
733.185      147.247       HNSW
    0.898        6.900  200000   100      50       64        250     7 bits    
23394     69.09       2894.82             4          746.51         0.50       
733.185      147.247       HNSW
    0.660        1.137  200000   100      50       64        250     4 bits     
1695     50.01       3999.44             3          675.40         0.01       
659.943       74.005       HNSW
    0.619        2.852  200000   100      50       64        250     4 bits    
11021     49.88       4010.03             3          675.31         0.10       
659.943       74.005       HNSW
    0.592        4.429  200000   100      50       64        250     4 bits    
27121     48.72       4104.75             3          675.30         0.50       
659.943       74.005       HNSW
    1.000        2.371  500000   100      50       64        250         no     
5017    244.18       2047.64             8         1501.36         0.01      
1464.844     1464.844       HNSW
    0.968       11.976  500000   100      50       64        250         no    
21270    266.14       1878.73             8         1501.19         0.10      
1464.844     1464.844       HNSW
    0.987       17.191  500000   100      50       64        250         no    
44939    239.83       2084.78             8         1501.26         0.50      
1464.844     1464.844       HNSW
    0.913        2.024  500000   100      50       64        250     7 bits     
5075    166.55       3002.17             9         1867.19         0.01      
1832.962      368.118       HNSW
    0.891       10.079  500000   100      50       64        250     7 bits    
21671    168.88       2960.73             9         1867.41         0.10      
1832.962      368.118       HNSW
    0.899       13.733  500000   100      50       64        250     7 bits    
47517    168.22       2972.25             9         1867.22         0.50      
1832.962      368.118       HNSW
    0.660        1.183  500000   100      50       64        250     4 bits     
5085    153.22       3263.30             5         1690.35         0.01      
1649.857      185.013       HNSW
    0.598        8.365  500000   100      50       64        250     4 bits    
23514    137.45       3637.69             8         1689.26         0.10      
1649.857      185.013       HNSW
    0.588        9.584  500000   100      50       64        250     4 bits    
48507    137.44       3638.00             8         1689.32         0.50      
1649.857      185.013       HNSW
   ```
   
   ### Candidate
   ```
   recall  latency(ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
visited  index(s)  index_docs/s  num_segments  index_size(MB)  selectivity  
vec_disk(MB)  vec_RAM(MB)  indexType
    1.000        0.618  200000   100      50       64        250         no     
1685    105.74       1891.47             3          600.11         0.01       
585.938      585.938       HNSW
    0.955        4.211  200000   100      50       64        250         no     
8446    104.30       1917.60             3          600.09         0.10       
585.938      585.938       HNSW
    0.970        6.499  200000   100      50       64        250         no    
17121    106.95       1869.98             3          600.11         0.50       
585.938      585.938       HNSW
    0.918        0.813  200000   100      50       64        250     7 bits     
2047     69.00       2898.68             5          746.34         0.01       
733.185      147.247       HNSW
    0.883        4.271  200000   100      50       64        250     7 bits     
8909     70.60       2832.98             4          746.46         0.10       
733.185      147.247       HNSW
    0.893        6.104  200000   100      50       64        250     7 bits    
21460     69.16       2891.72             5          746.39         0.50       
733.185      147.247       HNSW
    0.684        0.763  200000   100      50       64        250     4 bits     
1969     49.21       4064.54             3          675.34         0.01       
659.943       74.005       HNSW
    0.613        2.752  200000   100      50       64        250     4 bits     
9832     50.25       3979.78             3          675.31         0.10       
659.943       74.005       HNSW
    0.592        3.430  200000   100      50       64        250     4 bits    
20823     48.60       4115.06             3          675.33         0.50       
659.943       74.005       HNSW
    1.000        2.346  500000   100      50       64        250         no     
4996    243.49       2053.51             8         1501.29         0.01      
1464.844     1464.844       HNSW
    0.964       11.287  500000   100      50       64        250         no    
19991    243.30       2055.08             8         1501.34         0.10      
1464.844     1464.844       HNSW
    0.984       15.180  500000   100      50       64        250         no    
39049    245.65       2035.38             8         1501.41         0.50      
1464.844     1464.844       HNSW
    0.894        2.064  500000   100      50       64        250     7 bits     
4615    175.74       2845.05             9         1867.25         0.01      
1832.962      368.118       HNSW
    0.889        9.321  500000   100      50       64        250     7 bits    
20292    176.89       2826.68             9         1867.15         0.10      
1832.962      368.118       HNSW
    0.898       13.142  500000   100      50       64        250     7 bits    
43073    167.55       2984.20             9         1867.34         0.50      
1832.962      368.118       HNSW
    0.654        1.819  500000   100      50       64        250     4 bits     
5024    151.40       3302.55             5         1690.48         0.01      
1649.857      185.013       HNSW
    0.598        5.857  500000   100      50       64        250     4 bits    
19382    155.89       3207.37             5         1690.41         0.10      
1649.857      185.013       HNSW
    0.588        5.437  500000   100      50       64        250     4 bits    
29505    150.84       3314.77             5         1690.41         0.50      
1649.857      185.013       HNSW
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to