cheng66551 commented on PR #14163: URL: https://github.com/apache/lucene/pull/14163#issuecomment-2635558626
> I don't think we should merge this change, but it's good that you were able to use it to confirm that merging would reclaim these deleted docs. > > Can you add your data about this issue to #13226? There is a smell merging not keeping up or a bad interaction between the merge policy and soft deletes. @jpountz During the production environment validation of this PR, the number of docs.deleted documents was reduced from billions to millions, and the index storage size was decreased by five times, leading to very positive results. This PR serves as a tool, allowing users to independently decide whether to use it without affecting the underlying Lucene processes. Therefore, I believe this PR can be considered for merging. Regarding your request, "Can you add your data about this issue to #13226," due to the constraints of the production environment, enabling InfoStream logging is a complicated task. As a result, there are no more detailed logs available beyond the segments information. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org