Hi Erik, Erik Bryer <ebr...@isi.edu> writes:
> Thanks for your reply. I can't find NVML in the logs going back to > 11/22. dmesg goes back to the last boot, but has no mention of > NVML. Regarding make one up on my own, how does Slurm know string > "xyzzy" corresponds to a tesla gpu, e.g.? As I understand it, Slurm doesn't need to know the correspondence, since all it is doing is counting. If you define a GRES, say, magic:wand you can configure your nodes to have, say, 2 of these. Then if a job requests --gres=magic:wand:1 and starts, a subsequent job which requests --gres=magic:wand:2 will have to wait until the first magic wand become free again. However, Slurm doesn't need to know whether your nodes really do have magic wands, but your users do need to request them, if their jobs require them. To prevent them using a magic wand without requesting one, you have to check the job parameters on submission, which you can do via the job submit plugin. Regards Loris > Thanks, > Erik > ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- > From: slurm-users <slurm-users-boun...@lists.schedmd.com> on behalf of > Michael Di Domenico <mdidomeni...@gmail.com> > Sent: Tuesday, December 15, 2020 1:24 PM > To: Slurm User Community List <slurm-users@lists.schedmd.com> > Subject: Re: [slurm-users] gres names > > you can either make them up on your own or they get spit out by NVML > in the slurmd.log file > > On Tue, Dec 15, 2020 at 12:55 PM Erik Bryer <ebr...@isi.edu> wrote: >> >> Hi, >> >> Where do I get the gres names, e.g. "rtx2080ti", to use for my gpus in my >> node definitions in slurm.conf? >> >> Thanks, >> Erik > -- Dr. Loris Bennett (Hr./Mr.) ZEDAT, Freie Universität Berlin Email loris.benn...@fu-berlin.de