Re: [slurm-users] Set a ramdom offset when starting node health check in SLURM

2020-11-27 Thread Bjørn-Helge Mevik
You can also check out HealthCheckNodeState=CYCLE man slurm.conf: "Rather than running the health check program on all nodes at the same time, cycle through running on all compute nodes through the course of the HealthCheckInterval. May be combined with the various node state options." -- Chee

Re: [slurm-users] Set a ramdom offset when starting node health check in SLURM

2020-11-26 Thread Micheal Krombopulous
Call healthcheck with a shell script that starts with: sleep $[ ( $RANDOM % 10 ) + 1 ], or similar. M.K. From: slurm-users on behalf of SJTU Sent: Thursday, November 26, 2020 8:24 PM To: slurm-users@lists.schedmd.com Subject: [slurm-users] Set a ramdom offset