[
https://issues.apache.org/jira/browse/HADOOP-11000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320818#comment-14320818
]
Chris Nauroth commented on HADOOP-11000:
----------------------------------------
+1 from me too. [~vinayrpet], I see you already +1'd a few months ago but did
not commit. Was there a reason that you were holding off on the commit? I'm
also happy to do the commit if you prefer. Just let me know. Thanks!
> HAServiceProtocol's health state is incorrectly transitioned to
> SERVICE_NOT_RESPONDING
> --------------------------------------------------------------------------------------
>
> Key: HADOOP-11000
> URL: https://issues.apache.org/jira/browse/HADOOP-11000
> Project: Hadoop Common
> Issue Type: Bug
> Reporter: Ming Ma
> Assignee: Ming Ma
> Attachments: HADOOP-11000-2.patch, HADOOP-11000.patch
>
>
> When HAServiceProtocol.monitorHealth throws a HealthCheckFailedException, the
> actual exception from protocol buffer RPC is a RemoteException that wraps the
> real exception. Thus the state is incorrectly transitioned to
> SERVICE_NOT_RESPONDING
> {noformat}
> HealthMonitor.java
> doHealthChecks
> try {
> status = proxy.getServiceStatus();
> proxy.monitorHealth();
> healthy = true;
> } catch (HealthCheckFailedException e) {
> .....
> enterState(State.SERVICE_UNHEALTHY);
> } catch (Throwable t) {
> .....
> enterState(State.SERVICE_NOT_RESPONDING);
> .....
> }
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)