On Fri, Jun 28, 2013 at 09:45:50AM +0100, Jonathan Barber wrote:

> The problem with SSH based approaches is when you have failed nodes -
> normally they cause the entire command to hang until the attempted
> connection times out.

Normally what people do is ping the node before trying ssh on it. And
have reasonable timeouts around both the ssh connect and the command
execution. There's no fundamental reason why this is any different
from messaging or subscription-plus-messaging.

-- greg

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to