[slurm-users] Slurm versions 20.02.1 and 19.05.6 are now available

2020-03-26 Thread Tim Wickberg
We are pleased to announce the availability of Slurm versions 20.02.1 and 19.05.6. This includes a series of minor fixes since the last maintenance releases for both branches. Please note that the 19.05.6 release is expected to be the the last maintenance release of that branch (barring any

Re: [slurm-users] Slurm - Maridb error

2020-03-26 Thread Dhumal, Dr. Nilesh
Ole, Thanks for the link. I am able to run rpmbuild -ta slurm-20.02.0.tar.bz2 successfully. My computing node doesn't have internet access. Any recommendation, how to install all packages offline. I also tried to install slurm-19.05.5.tar.bz2. It doesn't get install successfully. Is it there a

[slurm-users] DefMemPerGPU bug?

2020-03-26 Thread Wayne Hendricks
When using 20.02/cons_tres and defining DefMemPerGPU, jobs submitted that request GPUs without defining “—mem” will not run more than one job per node. I can see where it is allocating the correct amount of memory for the job per GPUs requested, but no other jobs will run on the node. If a value

Re: [slurm-users] Is it safe to convert cons_res to cons_tres on a running system?

2020-03-26 Thread Steven Dick
When I changed this on a running system, no jobs were killed, but slurm lost track of jobs on nodes and was unable to kill them or tell when they were finished until slurmd on each node was restarted. I let running jobs complete and monitored them manually, and restarted slurmd on each node as the

Re: [slurm-users] Is it safe to convert cons_res to cons_tres on a running system?

2020-03-26 Thread Pär Lindfors
Hi Nate, On Fri, 2020-02-21 at 11:38 -0800, Nathan R Crawford wrote: > If it just requires restarting slurmctld and the slurmd processes > on the nodes, I will be happy! Can you confirm that no running or > pending jobs were lost in the transition? Did you change your SelectType to cons_tres? H