What version of Solr?

On Tue, Jan 20, 2015 at 7:07 AM, anand.mahajan <an...@zerebral.co.in> wrote:
> Hi all,
>
>
> I have a cluster with 36 Shards and 3 replica per shard. I had to recently
> restart the entire cluster - most of the shards & replica are back up - but
> a few shards have not had any leaders for a long long time (close to 18
> hours now) - I tried reloading these cores and even the servlet containers
> hosting these cores. Its only now that all the shards have leaders allocated
> - but few of these Leaders are still shown as Recovery Failed status on the
> Solr Cloud tree view.
>
>
> I see the following in the logs for these shards -
> INFO  - 2015-01-20 14:38:19.797;
> org.apache.solr.handler.admin.CoreAdminHandler; In WaitForState(recovering):
> collection=collection1, shard=shard1, thisCore=collection1_shard1_replica3,
> leaderDoesNotNeedRecovery=false, isLeader? true, live=true, checkLive=true,
> currentState=recovering, localState=recovery_failed,
> nodeName=10.68.77.9:8983_solr, coreNodeName=core_node2,
> onlyIfActiveCheckResult=true, nodeProps:
> core_node2:{"state":"recovering","core":"collection1_shard1_replica1","node_name":"10.68.77.9:8983_solr","base_url":"http://10.68.77.9:8983/solr"}
>
>
> And on other server hosting the replica for this shard -
> ERROR - 2015-01-20 14:38:20.768; org.apache.solr.common.SolrException;
> org.apache.solr.common.SolrException: I was asked to wait on state
> recovering for shard3 in collection1 on 10.68.77.9:8983_solr but I still do
> not see the requested state. I see state: recovering live:true leader from
> ZK: http://10.68.77.3:8983/solr/collection1_shard3_replica3/
>         at
> org.apache.solr.handler.admin.CoreAdminHandler.handleWaitForStateAction(CoreAdminHandler.java:999)
>         at
> org.apache.solr.handler.admin.CoreAdminHandler.handleRequestInternal(CoreAdminHandler.java:245)
>         at
> org.apache.solr.handler.admin.CoreAdminHandler.handleRequestBody(CoreAdminHandler.java:188)
>         at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.handleAdminRequest(SolrDispatchFilter.java:729)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:258)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:207)
>         at
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419)
>         at
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:455)
>         at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137)
>         at
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:557)
>         at
> org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231)
>         at
> org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1075)
>         at
> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:384)
>         at
> org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193)
>         at
> org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1009)
>         at
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
>         at
> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
>         at
> org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
>         at
> org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
>         at org.eclipse.jetty.server.Server.handle(Server.java:368)
>         at
> org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:489)
>         at
> org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53)
>         at
> org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(AbstractHttpConnection.java:942)
>         at
> org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpConnection.java:1004)
>         at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:640)
>         at 
> org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235)
>         at
> org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpConnection.java:72)
>         at
> org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(SocketConnector.java:264)
>         at
> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
>         at
> org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
>         at java.lang.Thread.run(Unknown Source)
>
>
> I see that there is no replica catch-up going on between any of these
> servers now.
> Couple of questions -
> 1. What is it that the Solr cloud is waiting on to allocate the leaders for
> such shards?
> 2. Why are few of these shards show leaders in Recovery Failed state? And
> how do I recover such shards?
>
> Thanks,
> Anand
>
>
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Leaders-in-Recovery-Failed-state-tp4180611.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to