What do the Solr logs show happens on those servers when they go into recovery? What have you tried to do to diagnose the problem? You might review: http://wiki.apache.org/solr/UsingMailingLists
The first thing I'd check, though, is whether you're seeing large GC pauses that exceed the Zookeeper timeout, thus ZK thinks the replica is down and puts it into recovery. YOu can get this info by tracking the GC cycles as here: https://lucidworks.com/blog/garbage-collection-bootcamp-1-0/, the section "getting a view into garbage collection" Best, Erick On Tue, Mar 24, 2015 at 5:57 AM, Gopal Jee <gopal....@myntra.com> wrote: > Hi > We have a large solrcloud cluster. We have observed that during heavy > indexing, large number of replicas go to recovering or down state. > What could be the possible reason and/or fix for the issue. > > Gopal