[slurm-users] Re: Crash in "slurmd -C" when latest NVIDIA drivers are used

2025-05-22 Thread Taras Shapovalov via slurm-users
pute-570-server nvidia-cuda-toolkit from https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/ That ends up grabbing the latest cuda bits which support the newer drivers. Brian Andrus On 5/19/2025 12:50 PM, Taras Shapovalov via slurm-users wrote: Hello, Does someone have i

[slurm-users] Crash in "slurmd -C" when latest NVIDIA drivers are used

2025-05-19 Thread Taras Shapovalov via slurm-users
Hello, Does someone have idea why "slurmd -C" crashes when it unloads gpu_nrt.so with latest NVIDIA drivers (570 and 575)? We checked, there is no crash in cuda at the moment and gpu_nvml.so works fine, all nvml calls finish successfully, dlclose on gpu_nvml.so works fine. The crash does not de

[slurm-users] IMEX plugin in Slurm 24.05

2024-06-19 Thread Taras Shapovalov via slurm-users
Hello, Does anyone know if there is any documentation about the NVIDIA IMEX plugin for Slurm 24.05? It is not even in man page for slurm.conf, though it is in the release notes. Best regards, Taras -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to s