Re: [slurm-users] Slurm -- using GPU cards with NVLINK

2020-09-11 Thread David Baker
Hi Ryan, Thank you very much for your reply. That is useful. We'll see how we get on. Best regards, David From: slurm-users on behalf of Ryan Novosielski Sent: 11 September 2020 00:08 To: Slurm User Community List Subject: Re: [slurm-users] Slurm -- usin

Re: [slurm-users] Slurm -- using GPU cards with NVLINK

2020-09-10 Thread Ryan Novosielski
I’m fairly sure that you set this up the same way you set up for a peer-to-peer setup. Here’s ours: [root@cuda001 ~]# nvidia-smi topo --matrix GPU0GPU1GPU2GPU3mlx4_0 CPU Affinity GPU0 X PIX SYS SYS PHB 0-11 GPU1PIX X SYS SYS

[slurm-users] Slurm -- using GPU cards with NVLINK

2020-09-10 Thread David Baker
Hello, We are installing a group of nodes which all contain 4 GPU cards. The GPUs are paired together using NVLINK as described in the matrix below. We are familiar with using Slurm to schedule and run jobs on GPU cards, but this is the first time we have dealt with NVLINK enabled GPUs. Could s