HI, All
I want to check strigger's fail event, but I don't know how to get into
'FAILING' state.
man strigger:
-F, --fail
Trigger an event if the specified node goes into a FAILING state.
How do I set the status of a node to FAILING?
[root@ohpc137pbsop-sms ~]# cat /usr/sbin/slurm_admin_notify
#!/bin/bash
# Submit trigger for next event
strigger --set --node --fail --program=/usr/sbin/slurm_admin_notify
echo "[event trigger] slurm admin notify fail event trigger run" >>
/tmp/slurmctld.log
[root@ohpc137pbsop-sms ~]#
[root@ohpc137pbsop-sms ~]# chmod +x /usr/sbin/slurm_admin_notify
[root@ohpc137pbsop-sms ~]# strigger --get
TRIG_ID RES_TYPE RES_ID TYPE OFFSET USER
FLAGS PROGRAM
18 node * fail 0 root
/usr/sbin/slurm_admin_notify
[root@ohpc137pbsop-sms ~]#
Change compute node status to FAILING.
Regards,
Tomo