Hi
I have a set of solr cloud nodes which were working correctly for a
while until one of them had problems (due to incorrect java version -
but the cause of how it went bad is not very important)
Once the problem was fixed and the node was started, I got the
following situation: The node continually tries to recover and fails
because it has not recovered.
I double checked that cluster state in zookeeper is correct and also
that remaining nodes see the cloud correctly (with this one node in not
started state).
Is there a good way to recover from here with least risk of losing data.
There is also a trace below from solr log.
Thanks
Vivek
This is for solr 4.6
ERROR - 2014-06-04 17:45:53.890; org.apache.solr.cloud.RecoveryStrategy;
Recovery failed - trying again... (2) core=weekly_20140331_replica1
INFO - 2014-06-04 17:45:53.891; org.apache.solr.cloud.RecoveryStrategy;
Wait 8.0 seconds before trying to recover again (3)
INFO - 2014-06-04 17:46:01.892; org.apache.solr.cloud.ZkController;
publishing core=weekly_20140331_replica1 state=recovering
ERROR - 2014-06-04 17:48:01.930; org.apache.solr.common.SolrException;
Error while trying to recover.
core=weekly_20140331_replica1:org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException:
I was asked to wait on state recovering for n09:8983_solr but I still do
not see the requested state. I see state: down live:true
at
org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:491)
at
org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:197)
at
org.apache.solr.cloud.RecoveryStrategy.sendPrepRecoveryCmd(RecoveryStrategy.java:221)
at
org.apache.solr.cloud.RecoveryStrategy.doRecovery(RecoveryStrategy.java:367)
at
org.apache.solr.cloud.RecoveryStrategy.run(RecoveryStrategy.java:244)
ERROR - 2014-06-04 17:48:01.930; org.apache.solr.cloud.RecoveryStrategy;
Recovery failed - trying again... (3) core=weekly_20140331_replica1
INFO - 2014-06-04 17:48:01.930; org.apache.solr.cloud.RecoveryStrategy;
Wait 16.0 seconds before trying to recover again (4)
INFO - 2014-06-04 17:48:17.932; org.apache.solr.cloud.ZkController;
publishing core=weekly_20140331_replica1 state=recovering
ERROR - 2014-06-04 17:50:17.969; org.apache.solr.common.SolrException;
Error while trying to recover.
core=weekly_20140331_replica1:org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException:
I was asked to wait on state recovering for n09:8983_solr but I still do
not see the requested state. I see state: down live:true
at
org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:491)
at
org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:197)
at
org.apache.solr.cloud.RecoveryStrategy.sendPrepRecoveryCmd(RecoveryStrategy.java:221)
at
org.apache.solr.cloud.RecoveryStrategy.doRecovery(RecoveryStrategy.java:367)
at
org.apache.solr.cloud.RecoveryStrategy.run(RecoveryStrategy.java:244)
ERROR - 2014-06-04 17:50:17.969; org.apache.solr.cloud.RecoveryStrategy;
Recovery failed - trying again... (4) core=weekly_20140331_replica1