if a single node goes down, you need to take down all the
nodes in the chassis before you can remove the dead node. Not very
practical.

Eh? What's so hard about marking the other nodes as unusable in your
batch system, and waiting for them to become free?

depends on your max job length.  but yeah, idling three nodes for a week
is not going to be noticable in anything but a quite small cluster...
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to