I made the gres.conf the same on both nodes and Slurm started without error. I'm now seeing another error.
There are 4 GPUs defined per node. If I start 2 jobs with #SBATCH --gpus=foolsgold:4 it runs one job in each of the 2 nodes. If I scancel those and run 4 jobs with the script reading #SBATCH --gpus=foolsgold:1 I get 2 queued and 2 running jobs. It seems allocating 1 gpu allocates all 4, not just 1. But why would this be so? Thanks, Erik ________________________________ From: slurm-users <slurm-users-boun...@lists.schedmd.com> on behalf of Chris Samuel <ch...@csamuel.org> Sent: Thursday, December 24, 2020 5:44 PM To: slurm-users@lists.schedmd.com <slurm-users@lists.schedmd.com> Subject: Re: [slurm-users] trying to add gres On 24/12/20 4:42 pm, Erik Bryer wrote: > I made sure my slurm.conf is synchronized across machines. My intention > is to add some arbitrary gres for testing purposes. Did you update your gres.conf on all the nodes to match? All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA