jpountz commented on issue #11761:
URL: https://github.com/apache/lucene/issues/11761#issuecomment-1251264246

   I got some numbers for write amplification for the case tested in 
`TestTieredMergePolicy#testSimulateUpdates`:
   
   | Allowed percentage of deletes | Write amplification |
   | ------------- | ------------- |
   | 50 (max) | 4.34 |
   | 33 (default) | 4.34 |
   | 20 (min) | 4.68 |
   | 10 | 6.13 |
   | 5 | 8.76 |
   | 4 | 10.31 |
   | 3 | 12.97 |
   | 2 | 18.76 |
   | 1 | 37.89 |
   | 0 | 10779.78 |
   
   Assuming these numbers are representative, maybe we could allow users to 
configure 5% as the allowed percentage of deletes that their indexes may have, 
which translates to ~2x more write amplification compared to the default of 33% 
according to the above numbers.
   
   For reference, the algorithm that `TieredMergePolicy` uses to keep the 
number of deletes under the threshold consists of running the most balanced 
merge (with a small bias towards merges that reclaim more deletes) until the 
number of deletes of the index is under the threshold.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to