tteofili commented on PR #14094: URL: https://github.com/apache/lucene/pull/14094#issuecomment-2740827581
additional experiments with different quantization levels and filtering: ## No-fitlering ### Baseline ``` recall latency(ms) nDoc topK fanout maxConn beamWidth quantized index(s) index_docs/s num_segments index_size(MB) vec_disk(MB) vec_RAM(MB) indexType 0.985 4.620 200000 100 50 64 250 no 106.46 1878.64 3 600.08 585.938 585.938 HNSW 0.899 3.657 200000 100 50 64 250 7 bits 67.74 2952.47 5 746.34 733.185 147.247 HNSW 0.585 2.328 200000 100 50 64 250 4 bits 46.86 4268.03 3 675.33 659.943 74.005 HNSW 0.983 9.212 500000 100 50 64 250 no 235.68 2121.56 8 1501.44 1464.844 1464.844 HNSW 0.900 7.562 500000 100 50 64 250 7 bits 165.99 3012.30 9 1867.29 1832.962 368.118 HNSW 0.580 4.934 500000 100 50 64 250 4 bits 130.65 3826.96 8 1689.29 1649.857 185.013 HNSW ``` ### Candidate ``` recall latency(ms) nDoc topK fanout maxConn beamWidth quantized visited index(s) index_docs/s num_segments index_size(MB) selectivity vec_disk(MB) vec_RAM(MB) indexType 0.980 3.744 200000 100 50 64 250 no 10690 106.82 1872.29 3 600.10 1.00 585.938 585.938 HNSW 0.896 3.473 200000 100 50 64 250 7 bits 11878 68.83 2905.54 5 746.39 1.00 733.185 147.247 HNSW 0.585 2.032 200000 100 50 64 250 4 bits 13279 51.32 3897.12 3 675.32 1.00 659.943 74.005 HNSW 0.982 8.549 500000 100 50 64 250 no 23079 248.29 2013.81 8 1501.32 1.00 1464.844 1464.844 HNSW 0.898 6.733 500000 100 50 64 250 7 bits 23629 167.17 2991.02 9 1867.31 1.00 1832.962 368.118 HNSW 0.581 3.776 500000 100 50 64 250 4 bits 21179 152.43 3280.24 5 1690.38 1.00 1649.857 185.013 HNSW ``` ## Filtering ### Baseline ``` recall latency(ms) nDoc topK fanout maxConn beamWidth quantized visited index(s) index_docs/s num_segments index_size(MB) selectivity vec_disk(MB) vec_RAM(MB) indexType 1.000 0.642 200000 100 50 64 250 no 1965 109.81 1821.26 3 600.16 0.01 585.938 585.938 HNSW 0.964 4.947 200000 100 50 64 250 no 9504 110.91 1803.33 3 600.11 0.10 585.938 585.938 HNSW 0.983 8.417 200000 100 50 64 250 no 22193 103.13 1939.28 3 600.09 0.50 585.938 585.938 HNSW 0.918 0.762 200000 100 50 64 250 7 bits 1981 64.33 3108.82 5 746.33 0.01 733.185 147.247 HNSW 0.892 4.310 200000 100 50 64 250 7 bits 10302 66.23 3019.87 5 746.34 0.10 733.185 147.247 HNSW 0.898 6.900 200000 100 50 64 250 7 bits 23394 69.09 2894.82 4 746.51 0.50 733.185 147.247 HNSW 0.660 1.137 200000 100 50 64 250 4 bits 1695 50.01 3999.44 3 675.40 0.01 659.943 74.005 HNSW 0.619 2.852 200000 100 50 64 250 4 bits 11021 49.88 4010.03 3 675.31 0.10 659.943 74.005 HNSW 0.592 4.429 200000 100 50 64 250 4 bits 27121 48.72 4104.75 3 675.30 0.50 659.943 74.005 HNSW 1.000 2.371 500000 100 50 64 250 no 5017 244.18 2047.64 8 1501.36 0.01 1464.844 1464.844 HNSW 0.968 11.976 500000 100 50 64 250 no 21270 266.14 1878.73 8 1501.19 0.10 1464.844 1464.844 HNSW 0.987 17.191 500000 100 50 64 250 no 44939 239.83 2084.78 8 1501.26 0.50 1464.844 1464.844 HNSW 0.913 2.024 500000 100 50 64 250 7 bits 5075 166.55 3002.17 9 1867.19 0.01 1832.962 368.118 HNSW 0.891 10.079 500000 100 50 64 250 7 bits 21671 168.88 2960.73 9 1867.41 0.10 1832.962 368.118 HNSW 0.899 13.733 500000 100 50 64 250 7 bits 47517 168.22 2972.25 9 1867.22 0.50 1832.962 368.118 HNSW 0.660 1.183 500000 100 50 64 250 4 bits 5085 153.22 3263.30 5 1690.35 0.01 1649.857 185.013 HNSW 0.598 8.365 500000 100 50 64 250 4 bits 23514 137.45 3637.69 8 1689.26 0.10 1649.857 185.013 HNSW 0.588 9.584 500000 100 50 64 250 4 bits 48507 137.44 3638.00 8 1689.32 0.50 1649.857 185.013 HNSW ``` ### Candidate ``` recall latency(ms) nDoc topK fanout maxConn beamWidth quantized visited index(s) index_docs/s num_segments index_size(MB) selectivity vec_disk(MB) vec_RAM(MB) indexType 1.000 0.618 200000 100 50 64 250 no 1685 105.74 1891.47 3 600.11 0.01 585.938 585.938 HNSW 0.955 4.211 200000 100 50 64 250 no 8446 104.30 1917.60 3 600.09 0.10 585.938 585.938 HNSW 0.970 6.499 200000 100 50 64 250 no 17121 106.95 1869.98 3 600.11 0.50 585.938 585.938 HNSW 0.918 0.813 200000 100 50 64 250 7 bits 2047 69.00 2898.68 5 746.34 0.01 733.185 147.247 HNSW 0.883 4.271 200000 100 50 64 250 7 bits 8909 70.60 2832.98 4 746.46 0.10 733.185 147.247 HNSW 0.893 6.104 200000 100 50 64 250 7 bits 21460 69.16 2891.72 5 746.39 0.50 733.185 147.247 HNSW 0.684 0.763 200000 100 50 64 250 4 bits 1969 49.21 4064.54 3 675.34 0.01 659.943 74.005 HNSW 0.613 2.752 200000 100 50 64 250 4 bits 9832 50.25 3979.78 3 675.31 0.10 659.943 74.005 HNSW 0.592 3.430 200000 100 50 64 250 4 bits 20823 48.60 4115.06 3 675.33 0.50 659.943 74.005 HNSW 1.000 2.346 500000 100 50 64 250 no 4996 243.49 2053.51 8 1501.29 0.01 1464.844 1464.844 HNSW 0.964 11.287 500000 100 50 64 250 no 19991 243.30 2055.08 8 1501.34 0.10 1464.844 1464.844 HNSW 0.984 15.180 500000 100 50 64 250 no 39049 245.65 2035.38 8 1501.41 0.50 1464.844 1464.844 HNSW 0.894 2.064 500000 100 50 64 250 7 bits 4615 175.74 2845.05 9 1867.25 0.01 1832.962 368.118 HNSW 0.889 9.321 500000 100 50 64 250 7 bits 20292 176.89 2826.68 9 1867.15 0.10 1832.962 368.118 HNSW 0.898 13.142 500000 100 50 64 250 7 bits 43073 167.55 2984.20 9 1867.34 0.50 1832.962 368.118 HNSW 0.654 1.819 500000 100 50 64 250 4 bits 5024 151.40 3302.55 5 1690.48 0.01 1649.857 185.013 HNSW 0.598 5.857 500000 100 50 64 250 4 bits 19382 155.89 3207.37 5 1690.41 0.10 1649.857 185.013 HNSW 0.588 5.437 500000 100 50 64 250 4 bits 29505 150.84 3314.77 5 1690.41 0.50 1649.857 185.013 HNSW ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org