through a clever bit of reflection, you can set the
reclaimDeletesWeight variable from solrconfig by including something
like
<double name="reclaimDeletesWeight">5</double> (going from memory
here, you'll get an error on startup if I've messed it up.....)

That may help..

Best,
Erick


On Wed, Mar 30, 2016 at 6:15 AM, David Santamauro
<david.santama...@gmail.com> wrote:
>
>
> On 03/30/2016 08:23 AM, Jostein Elvaker Haande wrote:
>>
>> On 30 March 2016 at 12:25, Markus Jelsma <markus.jel...@openindex.io>
>> wrote:
>>>
>>> Hello - with TieredMergePolicy and default reclaimDeletesWeight of 2.0,
>>> and frequent updates, it is not uncommon to see a ratio of 25%. If you want
>>> deletes to be reclaimed more often, e.g. weight of 4.0, you will see very
>>> frequent merging of large segments, killing performance if you are on
>>> spinning disks.
>>
>>
>> Most of our installations are on spinning disks, so if I want a more
>> aggressive reclaim, this will impact performance. This is of course
>> something that I do not desire, so I'm wondering if scheduling a
>> commit with 'expungeDeletes' during off peak business hours is a
>> better approach than setting up a more aggressive merge policy.
>>
>
> As far as my experimentation with @expungeDeletes goes, if the data you
> indexed and committed using @expungeDeletes didn't touch segments with any
> deleted documents nor wasn't enough data to cause merging with a segment
> containing deleted documents, no deleted documents will be removed.
> Basically, @expungeDeletes expunges deletes in segments affected by the
> commit. If you have a large update that touches many segments containing
> deleted documents and you use @expungeDeletes, it could be just as resource
> intensive as an optimize.
>
> My setting for reclaimDeletesWeight:
>   <double name="reclaimDeletesWeight">5.0</double>
>
> It keeps the deleted documents down to ~ 10% without any noticable impact on
> resources or performance. But I'm still in the testing phase with this
> setting.
>

Reply via email to