benwtrent commented on PR #14304: URL: https://github.com/apache/lucene/pull/14304#issuecomment-2697969649
I compared this branch with main. There are measurable improvements, but the quantization step isn't the main bottle neck. Vector comparisons still dominate the costs. But, its a nice bump I would say. candidate: ``` recall latency (ms) nDoc topK fanout maxConn beamWidth quantized index s index docs/s force merge s num segments index size (MB) vec disk (MB) vec RAM (MB) 0.826 2.340 500000 100 50 32 100 7 bits 86.54 5777.61 337.47 1 1859.34 1831.055 366.211 ``` baseline: ``` recall latency (ms) nDoc topK fanout maxConn beamWidth quantized index s index docs/s force merge s num segments index size (MB) vec disk (MB) vec RAM (MB) 0.828 2.680 500000 100 50 32 100 7 bits 88.48 5650.74 357.45 1 1859.57 1831.055 366.211 ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org