I have inherited a 20-node cluster that supposedly has an infiniband
network. I am testing some mpi applications and am seeing no
performance improvement with multiple nodes. So I am wondering if the
Infiband network even works?
The output of ifconfig -a shows an ib0 and ib1 network. I ran ethtools
ib0 and it shows:
Speed: 40000Mb/s
Link detected: no
and for ib1 it show:
Speed: 10000Mb/s
Link detected: no
I am assuming this means it is down? Any idea how to debug further and
restart it?
Thanks!
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf