[slurm-users] Re: slurm releases

2025-04-05 Thread Ryan Novosielski via slurm-users
There are multiple supported releases, and you can upgrade from any of the last 3 releases at present, which are released every 6 months. Major releases are more disruptive, and there is support for the previous versions to provide continuity of support. https://slurm.schedmd.com/upgrades.html

[slurm-users] cpus and gpus partitions and how to optimize the resource usage

2025-04-05 Thread Massimo Sgaravatto via slurm-users
Dear all We have just installed a small SLURM cluster composed of 12 nodes: - 6 CPU only nodes: 2 Sockets=2, 96 CoresPerSocket 2, ThreadsPerCore=2, 1.5 TB of RAM - 6 nodes with also GPUS: same conf of the CPU-only node + 4 H100 per node We started with a setup with 2 partitions: - a 'onlycpu

[slurm-users] Re: Best Way to See GPUs in Use?

2025-04-05 Thread Paul Edmon via slurm-users
If you do scontrol -d show node it will give what resources are actually being used in more details: [root@holy8a24507 general]# scontrol show node holygpu8a11101 NodeName=holygpu8a11101 Arch=x86_64 CoresPerSocket=48    CPUAlloc=70 CPUEfctv=96 CPUTot=96 CPULoad=173.07 AvailableFeatures=amd,holyn

[slurm-users] slum job sumisison using different UID/GID

2025-04-05 Thread navin srivastava via slurm-users
Hi, Can munge depend upon the UID/GID for the job submission? There is a scenario where the cluster is running with the NIS authentication mechanism. There is a login Node created which is part of LDAP and we installed the slurm to interact with slurm master. I am able to run jobs using root but