Hi Li, If you could supply some more info from your logs would help. We also had some similar issue. There were some bugs related to SolrCloud that were solved on solr 4.10.4 and further on solr 5.x. I would suggest you compare your logs with defects on 4.10.4 release notes to see if they are the same. Also, send relevant solr/zookeeper parts of logs to the mailing list.
On Thu, Apr 21, 2016 at 1:50 AM, Li Ding <li.d...@bloomreach.com> wrote: > Hi All, > > We are using SolrCloud 4.6.1. We have observed following behaviors > recently. A Solr node in a Solrcloud cluster is up but some of the cores > on the nodes are marked as down in Zookeeper. If the cores are parts of a > multi-sharded collection with one replica, the queries to that collection > will fail. However, when this happened, if we issue queries to the core > directly, it returns 200 and correct info. But once Solr got into the > state, the core will be marked down forever unless we do a restart on Solr. > > Has anyone seen this behavior before? Is there any to get out of the state > on its own? > > Thanks, > > Li >