On Fri, Sep 18, 2009 at 1:06 AM, Jibo John <jiboj...@mac.com> wrote: > Hello, > > Came across a lucene patch ( > http://issues.apache.org/jira/browse/LUCENE-1634) that would consider the > number of deleted documents as the criteria when deciding which segments to > merge. > > Since we expect to have very frequent deletes, we hope this would help > reclaim the space consumed by the deleted documents in a much more efficient > way. > > Currently, we can specify a mergepolicy in solrconfig.xml like this: > > > > > <!--<mergePolicy>org.apache.lucene.index.LogByteSizeMergePolicy</mergePolicy>--> > > > However, by default, calibrateSizeByDeletes = false in LogMergePolicy. > > I was wondering if there is a way I can modify calibrateSizeByDeletes just > by configuration ? >
Alas, no. The only option that I see for you is to sub-class LogByteSizeMergePolicy and set calibrateSizeByDeletes to true in the constructor. However, please open a Jira issue and so we don't forget about it. Also, you might be interested in expungeDeletes which has been added as a request parameter for commits. Calling commit with expungeDeletes=true will remove all deleted documents from the index but unlike an optimize it won't always reduce the index to a single segment. -- Regards, Shalin Shekhar Mangar.