Thank you very much!
Those were the missing settings!
I am not sure how I overlooked it for nearly two days, but I am happy
that its working now.
Cheers
Dominik Baack
Am 27.10.2022 um 19:23 schrieb Sean Maxwell:
It looks like you are missing some of the slurm.conf entries related
to
2 um 17:57 schrieb Sean Maxwell:
Hi Dominik,
Do you have ConstrainDevices=yes set in your cgroup.conf?
Best,
-Sean
On Thu, Oct 27, 2022 at 11:49 AM Dominik Baack
wrote:
Hi,
We are in the process of setting up SLURM on some DGX A100 nodes . We
are experiencing the problem that al
the correct id only discarded by the rest
of the system.
Cheers
Dominik Baack
Example:
baack@gwkilab:~$ srun --gpus=1 nvidia-smi
Thu Oct 27 17:39:04 2022
+-+
| NVIDIA-SMI 470.141.03 Driver Version: 470.141.03 CUDA