Pulkitg64 commented on PR #15003: URL: https://github.com/apache/lucene/pull/15003#issuecomment-3183949385
Adding some KnnPerfTestResults where I tried to simulate deletes while indexing docs. We are seeing consistent improvement in Indexing Time and Indexing Rate (except one weird case when we deleted 40% docs) without impacting recall. Num Docs: 1MM Max-Conn: 32 Beam-Width: 250 Quantize Bits: 32 Topk: 100 | Experiment | Baseline | | |Candidate | | | % Change | | |-----------|--------|---------------|---------------|--------|---------------|---------------|---------------|---------------| | % Deletes | Recall | Indexing Time (s) | Indexing Rate (docs/s) | Recall | Indexing Time (s) | Indexing Rate (docs/s) | Indexing Time | Indexing Rate | | 25 | 0.952 | 692 | 1443 | 0.955 | 576 | 1734 | -20% | 17% | | 30 | 0.952 | 581 | 1719 | 0.958 | 517 | 1932 | -12% | 11% | | 40 | 0.951 | 560 | 1782 | 0.945 | 553 | 1805 | -1% | 1% | | 50 | 0.96 | 446 | 2241 | 0.953 | 421 | 2371 | -6% | 5% | | 60 | 0.974 | 234 | 4265 | 0.972 | 208 | 4804 | -13% | 11% | -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org