On 3/10/20 9:03 AM, sysadmin.caos wrote:
my SLURM cluster has configured a partition with a "TimeLimit" of 8 hours. Now, a job is running during 9h30m and it has been not cancelled. During these 9 hours and a half, a script has executed a "scontrol update partition=mypartition state=down" for disabling this partition (educational cluster and at 8:00 start students classes).

Why my job hasn't been cancelled? There is no any log at SLURM controller that explains this behaviour.

You may want to check the following parameter in your slurm.conf file (read the man-page first):

AccountingStorageEnforce: This controls what level of association-based enforcement to impose on job submissions.

You may want to read about EnforcePartLimits and OverTimeLimit parameters as well.

Display your current configuration by: scontrol show config

/Ole



Reply via email to