Re: [slurm-users] gpu count

2019-06-27 Thread Valerio Bellizzomi
On Thu, 2019-06-27 at 15:50 +0200, Marcus Boden wrote: > Hi, > > this is usually due to a misconfiguration in your gres.conf (at least it > was for me). Can you show your gres.conf? I have revised the configuration, it needed File=... parameter. > Best, > Marcus > > On 19-06-27 15:33, Valer

Re: [slurm-users] gpu count

2019-06-27 Thread Marcus Boden
Hi, this is usually due to a misconfiguration in your gres.conf (at least it was for me). Can you show your gres.conf? Best, Marcus On 19-06-27 15:33, Valerio Bellizzomi wrote: > hello, my node has 2 gpus so I have specified gres=gpus:2 but the > scontrol show node displays this: > > State=IDLE

Re: [slurm-users] gpu count

2019-06-27 Thread Eli V
gres has to be specified in both slurm.conf and gres.conf and gres.conf must be present on the node with the gres. I keep a single cluster wide gres.conf and copy it to all nodes just like slurm.conf. Also, after adding a new gres I think both the slurmctld and the slurmd needs to be restarted. On

Re: [slurm-users] gpu count

2019-06-27 Thread Valerio Bellizzomi
On Thu, 2019-06-27 at 15:33 +0200, Valerio Bellizzomi wrote: > hello, my node has 2 gpus so I have specified gres=gpus:2 but the > scontrol show node displays this: > > State=IDLE+DRAIN > Reason=gres/gpus count too low (1 < 2) Also, the node is repeating a debug message: debug2: got this type of

[slurm-users] gpu count

2019-06-27 Thread Valerio Bellizzomi
hello, my node has 2 gpus so I have specified gres=gpus:2 but the scontrol show node displays this: State=IDLE+DRAIN Reason=gres/gpus count too low (1 < 2)