After the merge. That is what merges do, clean up segments.

I expect it is very rare for a segment to be 100% deleted docs, so it isn’t
worth handling that case.

wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/  (my blog)


> On Oct 28, 2016, at 5:54 PM, Alexandre Rafalovitch <arafa...@gmail.com> wrote:
> 
> Don't the segment that only has deleted documents just gets dropped?
> Or does it get dropped _after_ the merge and therefore still sits
> around?
> 
> Regards,
>   Alex.
> ----
> Solr Example reading group is starting November 2016, join us at
> http://j.mp/SolrERG
> Newsletter and resources for Solr beginners and intermediates:
> http://www.solr-start.com/
> 
> 
> On 29 October 2016 at 08:53, Walter Underwood <wun...@wunderwood.org> wrote:
>> It is normal for disk usage to double. Under controlled circumstances,
>> it can triple, but that probably won’t happen.
>> 
>> This is the second time today that I’ve sent this information to the list.
>> 
>> It can use nearly 2X the space whenever the largest segment(s) are
>> merged, especially if there are only a few smaller segments.
>> 
>> In order to use 3X the space, you need to:
>> 
>> 1. Disable merging.
>> 2. Delete all the documents.
>> 3. Add all the documents.
>> 4. Enable merging.
>> 
>> This causes one complete set of segments that are 100% deletes,
>> one set that is 0% deletes, then the merge creates another set that
>> is 0% deletes. During the merge, the old files remain while the
>> new one is created.
>> 
>> wunder
>> Walter Underwood
>> wun...@wunderwood.org
>> http://observer.wunderwood.org/  (my blog)
>> 
>> 
>>> On Oct 28, 2016, at 2:41 PM, Alexandre Rafalovitch <arafa...@gmail.com> 
>>> wrote:
>>> 
>>> 2) Is probably a merge operation. Lucene index segments are not
>>> rewritable in place, so the merge creates a new file, does everything
>>> to it, then switches to it.
>>> 
>>> I remember the number was that the space could temporarily triple
>>> (?!?) though that may have been before the tiered merge policy.
>>> 
>>> 3) It should be safe to delete old log files. It is standard log4j stuff.
>>> 
>>> ----
>>> Solr Example reading group is starting November 2016, join us at
>>> http://j.mp/SolrERG
>>> Newsletter and resources for Solr beginners and intermediates:
>>> http://www.solr-start.com/
>>> 
>>> 
>>> On 29 October 2016 at 06:55, Jamal, Sarfaraz
>>> <sarfaraz.ja...@verizonwireless.com.invalid> wrote:
>>>> Hi Guys,
>>>> 
>>>> I am currently investigating an instance of Solr's Disk space usage and I 
>>>> had a few questions I thought you guys might be able to help answer.
>>>> 
>>>> First Question
>>>> * There is 30 gb's worth of autosuggest data in the /tmp folder. Each file 
>>>> is half of a gigabyte
>>>> Is it safe to delete those files?
>>>> 
>>>> Second Question
>>>> Also, we notice that at times the disk runs down to only having a few 
>>>> gigabytes available, and then goes back to having more space. (the index 
>>>> file literally grows and then shrinks).
>>>> 
>>>> Third Question
>>>> Is it also safe to delete the log files?
>>>> 
>>>> We run a database indexer on a set interval, perhaps that is relevant to 
>>>> this discussion.
>>>> 
>>>> Sas
>> 

Reply via email to