Re: [slurm-users] Apparent scontrol reboot bug

2019-01-22 Thread Bas van der Vlies
Thanks for the update. We gonna try to build a new package and test it. On 22/01/2019 15:30, Douglas Jacobsen wrote: There were several related commits last week: https://github.com/SchedMD/slurm/commits/slurm-18.08 On Tue, Jan 22, 2019 at 06:28 Douglas Jacobsen > w

Re: [slurm-users] Apparent scontrol reboot bug

2019-01-22 Thread Douglas Jacobsen
There were several related commits last week: https://github.com/SchedMD/slurm/commits/slurm-18.08 On Tue, Jan 22, 2019 at 06:28 Douglas Jacobsen wrote: > Hello, > > Yes it's a bug in the way the reboot rpcs are handled. A fix was recently > committed which we have yet to test, but 18.08.5 is

Re: [slurm-users] Apparent scontrol reboot bug

2019-01-22 Thread Douglas Jacobsen
Hello, Yes it's a bug in the way the reboot rpcs are handled. A fix was recently committed which we have yet to test, but 18.08.5 is meant to repair this (among other things). Doug On Tue, Jan 22, 2019 at 02:46 Martijn Kruiten wrote: > Hi, > > We encounter a strange issue on our system (Slurm

[slurm-users] Apparent scontrol reboot bug

2019-01-22 Thread Martijn Kruiten
Hi, We encounter a strange issue on our system (Slurm 18.08.3), and I'm curious whether anyone of you recognizes this behavior. In the following example we try to reboot 32 nodes, of which 31 nodes are idle: root# scontrol reboot ASAP nextstate=resume reason=image r8n[1-32] root# sinfo -o "%100