Re: [slurm-users] A strange situation of different network cards on the same network

2023-10-10 Thread Ryan Novosielski
We have, and have had it come and go with no clear explanation. I’d watch out for MTU and netmask troubles, sysctl limits that might be relevant (apparently the default settings for time spent doing ethernet are really appropriate for <1 Gb, not so much faster), hot spots on the network, etc. -

[slurm-users] A strange situation of different network cards on the same network

2023-10-10 Thread James Lam
We have a cluster of 176 nodes consisting Infiniband switch and 10GbE and we are using 10GbE as SSH. Currently we have the older cards of Marvell 10GbE at launch https://support.hpe.com/connect/s/softwaredetails?language=en_US&softwareId=MTX_117b0672d7ef4c5bb0eca02886