Hi Ryan,
Thank you very much for your reply. That is useful. We'll see how we get on.
Best regards,
David
From: slurm-users on behalf of Ryan
Novosielski
Sent: 11 September 2020 00:08
To: Slurm User Community List
Subject: Re: [slurm-users] Slurm -- usin
I’m fairly sure that you set this up the same way you set up for a peer-to-peer
setup. Here’s ours:
[root@cuda001 ~]# nvidia-smi topo --matrix
GPU0GPU1GPU2GPU3mlx4_0 CPU Affinity
GPU0 X PIX SYS SYS PHB 0-11
GPU1PIX X SYS SYS
Hello,
We are installing a group of nodes which all contain 4 GPU cards. The GPUs are
paired together using NVLINK as described in the matrix below.
We are familiar with using Slurm to schedule and run jobs on GPU cards, but
this is the first time we have dealt with NVLINK enabled GPUs. Could s