vivekratnavel opened a new pull request #1424: HDDS-2107. Datanodes should 
retry forever to connect to SCM in an…
URL: https://github.com/apache/hadoop/pull/1424
 
 
   … unsecure environment
   
    In an unsecure environment, the datanodes try upto 10 times after waiting 
for 1000 milliseconds each time before throwing this error:
   
   ```Unable to communicate to SCM server at scm:9861 for past 0 seconds.
   java.net.ConnectException: Call From scm:9861 failed on connection 
exception: java.net.ConnectException: Connection refused;```
   
   This PR fixes that issue by having datanodes try forever to connect with SCM 
and not fail immediately after 10 retries.
   
   I have also increased timeouts on a unit test to improve its stability.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to