Re: [slurm-users] 2 nodes being randomly set to "not responding"

2021-07-22 Thread Russell Jones
; > source: https://slurm.schedmd.com/elastic_computing.html > > Cheers > > Josef > > Sent from Nine <http://www.9folders.com/> > > -- > *From:* Russell Jones > *Sent:* Wednesday, 21 July 2021 22:30 > *To:* Slurm User Community List > *Subject

Re: [slurm-users] 2 nodes being randomly set to "not responding"

2021-07-21 Thread jose
sell Jones Sent: Wednesday, 21 July 2021 22:30 To: Slurm User Community List Subject: [slurm-users] 2 nodes being randomly set to "not responding" Hi all, We have a single slurm cluster with multiple different architectures and compute clusters talking to a single slurmctld. This slurmc

[slurm-users] 2 nodes being randomly set to "not responding"

2021-07-21 Thread Russell Jones
Hi all, We have a single slurm cluster with multiple different architectures and compute clusters talking to a single slurmctld. This slurmctld is dual-homed on two different networks. We have two individual nodes who are by themselves on "network 2" while all of the other nodes are on "network 1"