[ 
https://issues.apache.org/jira/browse/HADOOP-11000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320818#comment-14320818
 ] 

Chris Nauroth commented on HADOOP-11000:
----------------------------------------

+1 from me too.  [~vinayrpet], I see you already +1'd a few months ago but did 
not commit.  Was there a reason that you were holding off on the commit?  I'm 
also happy to do the commit if you prefer.  Just let me know.  Thanks!

> HAServiceProtocol's health state is incorrectly transitioned to 
> SERVICE_NOT_RESPONDING
> --------------------------------------------------------------------------------------
>
>                 Key: HADOOP-11000
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11000
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Ming Ma
>            Assignee: Ming Ma
>         Attachments: HADOOP-11000-2.patch, HADOOP-11000.patch
>
>
> When HAServiceProtocol.monitorHealth throws a HealthCheckFailedException, the 
> actual exception from protocol buffer RPC is a RemoteException that wraps the 
> real exception. Thus the state is incorrectly transitioned to 
> SERVICE_NOT_RESPONDING
> {noformat}
> HealthMonitor.java
> doHealthChecks
>       try {
>         status = proxy.getServiceStatus();
>         proxy.monitorHealth();
>         healthy = true;
>       } catch (HealthCheckFailedException e) {
>         .....
>         enterState(State.SERVICE_UNHEALTHY);
>       } catch (Throwable t) {
>         .....
>         enterState(State.SERVICE_NOT_RESPONDING);
>         .....
>       }
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to