You can also check out
HealthCheckNodeState=CYCLE
man slurm.conf:
"Rather than running the health check program on all nodes at the same
time, cycle through running on all compute nodes through the course of
the HealthCheckInterval. May be combined with the various node state
options."
--
Chee
Call healthcheck with a shell script that starts with:
sleep $[ ( $RANDOM % 10 ) + 1 ], or similar.
M.K.
From: slurm-users on behalf of SJTU
Sent: Thursday, November 26, 2020 8:24 PM
To: slurm-users@lists.schedmd.com
Subject: [slurm-users] Set a ramdom offset