[slurm-users] addressing NVIDIA MIG + non MIG devices in Slurm

Matthias Leopold Thu, 27 Jan 2022 07:28:45 -0800

Hi,

we have 2 DGX A100 systems which we would like to use with Slurm. Wewant to use the MIG feature for _some_ of the GPUs. As I somehowsuspected I couldn't find a working setup for this in Slurm yet. I'lldescribe the configuration variants I tried after creating the MIGinstances, it might be a longer read, please bear with me.

1. using slurm-mig-discovery for gres.conf(https://gitlab.com/nvidia/hpc/slurm-mig-discovery)

- CUDA_VISIBLE_DEVICES: list of indices

-> seems to bring a working setup and full flexibility at first, butwhen taking a closer look the selection of GPU devices is completelyunpredictable (output of nvidia-smi inside Slurm job)


2. using "AutoDetect=nvml" in gres.conf (Slurm docs)

- CUDA_VISIBLE_DEVICES: MIG format (seehttps://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#env-vars)


2.1 converting ALL GPUs to MIG
- also a full A100 is converted to a 7g.40gb MIG instance
- gres.conf: "AutoDetect=nvml" only
- slurm.conf Node Def: naming all MIG types (read from slurmd debug log)
-> working setup

-> problem: IPC (MPI) between MIG instances not possible, this seems tobe a by-design limitation


2.2 converting SOME GPUs to MIG
- some A100 are NOT in MIG mode

2.2.1 using "AutoDetect=nvml" only (Variant 1)
- slurm.conf Node Def: Gres with and without type

-> problem: fatal: _foreach_slurm_conf: Some gpu GRES in slurm.conf havea type while others do not (slurm_gres->gres_cnt_config (26) > tmp_count(21))


2.2.2 using "AutoDetect=nvml" only (Variant 2)
- slurm.conf Node Def: only Gres without type (sum of MIG + non MIG)
-> problem: different GPU types can't be requested

2.2.3 using partial "AutoDetect=nvml"
- gres.conf: "AutoDetect=nvml" + hardcoding of non MIG GPUs
- slurm.conf Node Def: MIG + non MIG Gres types
-> produces a "perfect" config according to slurmd debug log

-> problem: the sanity-check mode of "AutoDetect=nvml" preventsoperation (?)

-> Reason=gres/gpu:1g.5gb count too low (0 < 21) [slurm@2022-01-27T11:23:59]

2.2.4 using static gres.conf with NVML generated config

- using a gres.conf with NVML generated config where I can define thetype for non MIG GPU and also set the UniqueId for MIG instances wouldbe the perfect solution

- slurm.conf Node Def: MIG + non MIG Gres types
-> problem: it doesn't work
-> Parsing error at unrecognized key: UniqueId

Thanks for reading this far. Am I missing something? How can MIG and nonMIG devices be addressed in a cluster? This setup of having MIG and nonMIG devices can't be exotic, since having ONLY MIG devices has severedisadvantages (see 2.1). Thanks again for any advice.


Matthias

[slurm-users] addressing NVIDIA MIG + non MIG devices in Slurm

Reply via email to