msokolov commented on PR #15003: URL: https://github.com/apache/lucene/pull/15003#issuecomment-3188897846
It's fascinating that we actually see recall *improving* in many cases! Intuitively, I think when we merge more segments in we have an opportunity to patch up the holes left by the deleted docs, and maybe we somehow end up doing that in an even better way the second time around? I do wonder what recall will look like for graphs with high deletion rates that are singleton-merged only? I wonder if we could test that with `luceneutil` by creating a single-segment index (with force-merge), deleting 50% of the docs, and then force-merging again? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org