renderD*?

Martin Pecka Thu, 06 Jan 2022 09:23:01 -0800

Hello, I'm reviving a bit of old thread, but I just noticed I don't seemy January 2021 message in the archives, so I'm sending it again nowthat the issue again got live on our side.

To quickly recap, we want to add permissions not only to /dev/nvidia*devices based on the requested gres, but also to the corresponding/dev/dri/card* and /dev/dri/renderD* devices - they are all connected tothe same GPU, but the additional two allow using the card for renderinginstead of CUDA computations etc. I had some idea how to achieve thatwithout changing SLURM codebase, and I got something that could almostwork. It probably just needs some polishing. Could anybody pleasecomment whether the proposed solution is a good idea?



The 15 Jan 2021 message:

So I started thinking if this could not be somehow handled by a prologuescript and direct cgroup manipulation? I'm no expert in either, soplease check my lines of thoughts.

|#!/bin/bash PATH=/usr/bin/:/bingpus=${SLURM_STEP_GPUS:-$SLURM_JOB_GPUS} # or CUDA_VISIBLE_DEVICES whenrun inside the cgroup? cgroup=$(cat /proc/self/cgroup | grep devices |cut -d: -f3) # or something else? # blacklist all DRM devices (major226) cgset -r devices.deny="a 226:* rwm" devices:${cgroup} forNVIDIA_SMI_ID in |||${gpus//,/ }|; do # find on which PCI path does this device sitpci_id=$(nvidia-smi -i $NVIDIA_SMI_ID --query-gpu=pci.bus_id--format=noheader,csv | tail -c+5 | tr '[:upper:]' '[:lower:]') # findthe DRM devices sitting on the same PCI bus card=$(ls/sys/bus/pci/devices/${pci_id}/drm/ | grep card | xargs basename)render=$(ls /sys/bus/pci/devices/${pci_id}/drm/ | grep renderD | xargsbasename) # allow access to the DRM devices [ -n "${card}" ] && |||cgset -r devices.allow="c $(cat /sys/class/drm/${card}/dev) rw"devices:${cgroup} && echo "Allowed /dev/dri/${card} DRI device access"| |||[ -n "${render}" ] && |||cgset -r devices.allow="c $(cat/sys/class/drm/${render}/dev) rw" devices:${cgroup}||||||&& echo "Allowed /dev/dri/${render} render node access"|| done |

Now I wonder whether this should be Prolog=, TaskProlog= or somethingelse (that would also change whether I look at CUDA_VISIBLE_DEVICES orSLURM_STEP_GPUS, and how I figure out the cgroup name). I guess thatwere this script run as the invoking user, then nothing would preventhim from gaining access to all devices again. So I'd incline to treat itas a Prolog= script run by root. How would I get the cgroup ID then?Compose it from parts as mentioned in the slurm cgroups docs?(/cgroup/cpuset/slurm/uid_100/job_123/step_0/task_2) Or is there a morereliable way?

A related but offtopic idea popped up in my head when thinking aboutGPUs. Most of them are actually a consolidation of more devices likestream processors, encoders, decoders, raytraces, shaders, memory etc.Could it be possible (in future) to actually offer each of these piecesas a different gres? The problem is most of them do not have any specialfile which the user could lock to tell the others he's playing therenow. So it'd probably require support at the level of cgroupimplemetation, which, in turn, would require changing all GPU drivers.And it would require being able to request just chunks of GPU memory(not sure if that's possible right now, but I think I saw some pullrequest about that).



Thank you for hints!


Martin


Dne 21.10.2020 v 19:09 Martin Pecka napsal(a):

Or maybe could this be "emulated" by a set of 3 GRES per card that are"linked" together? I.e. rules like "if the user requests GRES/dev/dri/card0, he will also automatically need to claim/dev/dri/renderD128 and /dev/nvidia0"?
Dne 21.10.2020 v 18:52 Daniel Letai napsal(a):
Take a look at https://github.com/SchedMD/slurm/search?q=dri%2F
If the ROCM-SMI API is present, using AutoDetect=rsmi in gres.confmight be enough, if I'm reading this right.
Of course, this assumes the cards in question are AMD and not NVIDIA.


On 20/10/2020 23:58, Mgr. Martin Pecka wrote:
Pinging this topic again. Nobody has an idea how to define multiplefiles to be treated as a single gres?
Thank you for help,

Martin Pecka

Dne 4.9.2020 v 21:29 Martin Pecka napsal(a):
Hello, we want to use EGL backend for accessing OpenGL without theneed for Xorg. This approach requires access to devices/dev/dri/card* and /dev/dri/renderD* . Is there a way to giveaccess to these devices along with /dev/nvidia* which we use forCUDA? Ideally as a single generic resource that would givepermissions to all three files at once.
Thank you for any tips.

Re: [slurm-users] Use gres to handle permissions of /dev/dri/card* and /dev/dri/renderD*?

Reply via email to