What do the Solr logs show happens on those servers when they go into
recovery? What have you tried to do to diagnose the problem? You might
review: http://wiki.apache.org/solr/UsingMailingLists

The first thing I'd check, though, is whether you're seeing large GC
pauses that exceed the Zookeeper timeout, thus ZK thinks the replica
is down and puts it into recovery. YOu can get this info by tracking
the GC cycles as here:
https://lucidworks.com/blog/garbage-collection-bootcamp-1-0/, the
section "getting a view into garbage collection"

Best,
Erick

On Tue, Mar 24, 2015 at 5:57 AM, Gopal Jee <gopal....@myntra.com> wrote:
> Hi
> We have a large solrcloud cluster. We have observed that during heavy
> indexing, large number of replicas go to recovering or down state.
> What could be the possible reason and/or fix for the issue.
>
> Gopal

Reply via email to