Re: [slurm-users] Effect of slurmctld and slurmdb going down on running/pending jobs

Barbara Krašovec Wed, 23 Jun 2021 22:29:59 -0700

Just in case, increase Slurmdtimeout in slurm.conf (so that when thecontroller is back, it will give you time to fix the issues with thecommunication between slurmd and slurmctld - if there will be any).Otherwise it should not affect running and pending jobs. First stopcontroller, then slurmdbd. And then when the disk arrangements are done,first start slurmdbd and then slurmctld.


Cheers,


Barbara

On 6/24/21 12:54 AM, Amjad Syed wrote:

Hello all
We have a cluster running centos 7 . Our slurm scheduler isrunning on a vm machine and we are running out of disk space for /var The slurm innodb is taking most of space. We intend to expand thevdisk for slurm server. This will require a reboot for changes totake effect. Do we have to stop users submitting jobs by drainingall partitions and then restart the server. That is slurmctld.slurmdband mariadb? Or will the restarting of slurm vm have no effect onrunning/pending iobs?
Sincerely

Amjad

Re: [slurm-users] Effect of slurmctld and slurmdb going down on running/pending jobs

Reply via email to