Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Rafał Kędziorski
o.k. thx for the explanation. Am Fr., 27. Sept. 2019 um 15:38 Uhr schrieb Steffen Grunewald < steffen.grunew...@aei.mpg.de>: > On Fri, 2019-09-27 at 14:58:40 +0200, Rafał Kędziorski wrote: > > Am Fr., 27. Sept. 2019 um 13:50 Uhr schrieb Steffen Grunewald < > > steffen.grunew...@aei.mpg.de>: > > >

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Steffen Grunewald
On Fri, 2019-09-27 at 14:58:40 +0200, Rafał Kędziorski wrote: > Am Fr., 27. Sept. 2019 um 13:50 Uhr schrieb Steffen Grunewald < > steffen.grunew...@aei.mpg.de>: > > On Fri, 2019-09-27 at 11:19:16 +0200, Juergen Salk wrote: > > > > > > you may try setting `ReturnToService=2´ in slurm.conf. > > > > >

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Juergen Salk
* Rafał Kędziorski [190927 14:58]: > > > > > > you may try setting `ReturnToService=2´ in slurm.conf. > > > > > > > Caveat: A spontaneously rebooting machine may create a "black hole" this > > way. > > > > How do you mean this? Could ReturnToService=2 be a problem? > Hi Rafał, black hole syndr

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Rafał Kędziorski
Am Fr., 27. Sept. 2019 um 13:50 Uhr schrieb Steffen Grunewald < steffen.grunew...@aei.mpg.de>: > On Fri, 2019-09-27 at 11:19:16 +0200, Juergen Salk wrote: > > Hi Rafał, > > > > you may try setting `ReturnToService=2´ in slurm.conf. > > > > Best regards > > Jürgen > > Caveat: A spontaneously reboot

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Steffen Grunewald
On Fri, 2019-09-27 at 11:19:16 +0200, Juergen Salk wrote: > Hi Rafał, > > you may try setting `ReturnToService=2´ in slurm.conf. > > Best regards > Jürgen Caveat: A spontaneously rebooting machine may create a "black hole" this way. - Steffen -- Steffen Grunewald, Cluster Administrator Max P

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Juergen Salk
Hi Rafał, you may try setting `ReturnToService=2´ in slurm.conf. Best regards Jürgen -- Jürgen Salk Scientific Software & Compute Services (SSCS) Kommunikations- und Informationszentrum (kiz) Universität Ulm Telefon: +49 (0)731 50-22478 Telefax: +49 (0)731 50-22471 * Rafał Kędziorski [190927

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Rafał Kędziorski
Hi Andreas, my Cluster is not running whole time. I call just sudo shutdown. And after boot the nodes are in state down. I'm using Slurn on Raspi Cluster (5* Pi 4). What is the best way to shutdown the nodes that after boot the nodes are idle and not down? Regards, Rafal Am Fr., 27. Sept. 2019

Re: [slurm-users] After reboot nodes are in state = down

2019-09-26 Thread Henkel, Andreas
Hi Rafal, How do you restart the nodes? If you don’t use scontrol reboot Slurm doesn’t expect nodes to reboot therefore you see that reason in those cases. Best Andreas Am 27.09.2019 um 07:53 schrieb Rafał Kędziorski mailto:rafal.kedzior...@gmail.com>>: Hi, I'm working with slurm-wlm 18.08.

[slurm-users] After reboot nodes are in state = down

2019-09-26 Thread Rafał Kędziorski
Hi, I'm working with slurm-wlm 18.08.5-2 on Raspberry Pi Cluster: - 1 Pi 4 as manager - 4 Pi 4 nodes This work fine. But after every restart of the nodes I get this cluster@pi-manager:~ $ sinfo PARTITION AVAIL TIMELIMIT NODES STATE NODELIST devcluster*up infinite 4 down pi-4-n