cheng66551 commented on PR #14163:
URL: https://github.com/apache/lucene/pull/14163#issuecomment-2635560399

   > I don't think we should merge this change, but it's good that you were 
able to use it to confirm that merging would reclaim these deleted docs.
   > 
   > Can you add your data about this issue to #13226? There is a smell merging 
not keeping up or a bad interaction between the merge policy and soft deletes.
   
   @jpountz 
   During the production environment validation of this PR, the docs.deleted 
count was reduced from billions to millions, and the index storage size was 
decreased by five times, resulting in excellent performance.
   
   This PR functions as a tool, allowing users to decide whether to use it 
without impacting the core Lucene merge process. Therefore, I believe this PR 
should be considered for merging.
   
   Regarding your inquiry, "Can you add your data about this issue to #13226," 
due to the constraints of the production environment, enabling InfoStream 
logging is quite cumbersome. As a result, there are no more detailed logs 
available beyond the segments information.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to