I made the gres.conf the same on both nodes and Slurm started without error. 
I'm now seeing another error.

There are 4 GPUs defined per node. If I start 2 jobs with
#SBATCH --gpus=foolsgold:4
it runs one job in each of the 2 nodes. If I scancel those and run 4 jobs with 
the script reading
#SBATCH --gpus=foolsgold:1
I get 2 queued and 2 running jobs. It seems allocating 1 gpu allocates all 4, 
not just 1. But why would this be so?

Thanks,
Erik
________________________________
From: slurm-users <slurm-users-boun...@lists.schedmd.com> on behalf of Chris 
Samuel <ch...@csamuel.org>
Sent: Thursday, December 24, 2020 5:44 PM
To: slurm-users@lists.schedmd.com <slurm-users@lists.schedmd.com>
Subject: Re: [slurm-users] trying to add gres

On 24/12/20 4:42 pm, Erik Bryer wrote:

> I made sure my slurm.conf is synchronized across machines. My intention
> is to add some arbitrary gres for testing purposes.

Did you update your gres.conf on all the nodes to match?

All the best,
Chris
--
Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA

Reply via email to