dan2097 commented on issue #11761:
URL: https://github.com/apache/lucene/issues/11761#issuecomment-1255309927

   I have also ran into this on our patent search system. In our index the 
problem is exagerrated by the larger documents tending to be more frequently 
reindexed so the 20% deleted documents can translate to 40% of the overall 
index size!
   For my use case 5% would be a massive improvement.
   
   I ccan definitely imagine that for a system where indexing is light and 
infrequent 2% may make sense to ensure optimal perfomance/disk usage, without 
requiring the explicit use expungeDeletes. Having said that 5% is definitely 
low enough for my use case.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to