Re: [slurm-users] slurm.conf syntax checker?

2021-10-13 Thread Paul Edmon
Sadly no.  There is a feature request for one though: https://bugs.schedmd.com/show_bug.cgi?id=3435 What we've done in the meantime is put together a gitlab runner which basically starts up a mini instance of the scheduler and runs slurmctld on the slurm.conf we want to put in place.  We then

Re: [slurm-users] job is pending but resources are available

2021-10-13 Thread Adam Xu
在 10/13/21 16:30, Ole Holm Nielsen 写道: On 10/13/21 9:59 AM, Adam Xu wrote: 在 2021/10/13 9:22, Brian Andrus 写道: Something is very odd when you have the node reporting: RealMemory=1 AllocMem=0 FreeMem=47563 Sockets=2 Boards=1 What do you get when you run ‘slurmd -C’ on the node? # slurmd

Re: [slurm-users] job is pending but resources are available

2021-10-13 Thread Ole Holm Nielsen
On 10/13/21 9:59 AM, Adam Xu wrote: 在 2021/10/13 9:22, Brian Andrus 写道: Something is very odd when you have the node reporting: RealMemory=1 AllocMem=0 FreeMem=47563 Sockets=2 Boards=1 What do you get when you run ‘slurmd -C’ on the node? # slurmd -C NodeName=apollo CPUs=36 Boards=1 Socket

Re: [slurm-users] job is pending but resources are available

2021-10-13 Thread Adam Xu
在 2021/10/13 9:22, Brian Andrus 写道: Something is very odd when you have the node reporting: RealMemory=1 AllocMem=0 FreeMem=47563 Sockets=2 Boards=1 What do you get when you run ‘slurmd -C’ on the node? # slurmd -C NodeName=apollo CPUs=36 Boards=1 SocketsPerBoard=2 CoresPerSocket=18 Thread