I think it is very likely that it is due to Solr-nodes losing
ZK-connections (after timeout). We have experienced that a lot. One
thing you want to do, is to make sure your ZK-servers does not run on
the same machines as your Solr-nodes - that helped us a lot.
On 24/03/15 13:57, Gopal Jee wrot
What do the Solr logs show happens on those servers when they go into
recovery? What have you tried to do to diagnose the problem? You might
review: http://wiki.apache.org/solr/UsingMailingLists
The first thing I'd check, though, is whether you're seeing large GC
pauses that exceed the Zookeeper t