Thanks for the info and link to your bug report. Unfortunately, my
GraceTime is already set to zero for that QOS:
$ sacctmgr show qos interruptible format=Name,gracetime
Name GraceTime
-- --
interrupt+ 00:00:00
On 2/26/21 3:58 PM, Michael Robbert wrote:
We saw som
We saw something that sounds similar to this. See this bug report:
https://bugs.schedmd.com/show_bug.cgi?id=10196
SchedMD never found the root cause. They thought it might have something to do
with a timing problem on Prolog scripts, but the thing that fixed it for us was
to set GraceTime=0 on