Don’t optimize. The first article isn’t as clear as it should be. The important 
sentence is "Unless you are running into resource problems, it’s best to leave 
merging alone.”

I’ve been running Solr in production since version 1.3, with several different 
kinds and sizes of collections. I’ve never run a daily optimize, even on 
collections that only change once per day.

The section titles "What? I can’t afford 50% “wasted” space” should have just 
been “Then don’t run Solr”. Really, you should have 100% free sapce, so a 22 Gb 
index would be on a volume with 22 Gb of free space.

It was a mistake to name it “optimize”. It should have been “force merge”.

wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/  (my blog)

> On Oct 2, 2018, at 6:04 AM, Jeff Courtade <courtadej...@gmail.com> wrote:
> 
> We run an old master/slave solr 4.3.0 solr cluster
> 
> 14 nodes 7/7
> indexes average 47/5 gig per shard around 2 mill docs per shard.
> 
> We have constant daily additions and a small amount of deletes.
> 
> We optimize nightly currently and it is a system hog.
> 
> Is it feasible to never run optimize?
> 
> I ask because it seems like it would be very bad not to but this
> information is out there apparently recommending exactly that... never
> optimizing.
> 
> https://lucidworks.com/2017/10/13/segment-merging-deleted-documents-optimize-may-bad/
> 
> https://lucidworks.com/2018/06/20/solr-and-optimizing-your-index-take-ii/
> 
> https://wiki.apache.org/solr/SolrPerformanceFactors#Optimization_Considerations

Reply via email to