Hi
I'm running a cluster in a cloud provider and have run up against an odd
problem with power save. I've got several hundred nodes that Slurm won't
power up even though they appear idle and in the powered-down state. I
suspect that they are in a "not-so-idle" state: `scontrol` for all of the
no
ronan, you would test open ports using 'nmap'
On 17 July 2018 at 13:17, Buckley, Ronan wrote:
> Disabling the firewall service on the centos client allows the ‘srun
> hostname’ command to run.
>
>
>
> *From:* Buckley, Ronan
> *Sent:* Tuesday, July 17, 2018 12:00 PM
> *To:* 'Slurm User Community
Disabling the firewall service on the centos client allows the ‘srun hostname’
command to run.
From: Buckley, Ronan
Sent: Tuesday, July 17, 2018 12:00 PM
To: 'Slurm User Community List'
Subject: RE: [slurm-users] 'srun hostname' hangs on the command line
Hi Carlos, Is there a way to test that? A
Hi Carlos, Is there a way to test that? Are there certain ports that need to be
open? Thanks.
From: slurm-users [mailto:slurm-users-boun...@lists.schedmd.com] On Behalf Of
Carlos Fenoy
Sent: Tuesday, July 17, 2018 11:55 AM
To: Slurm User Community List
Subject: Re: [slurm-users] 'srun hostname'
The communication from the compute nodes to the login nodes may be block by the
firewall. That will prevent srun from running properly
Sent from my iPhone
> On 17 Jul 2018, at 10:16, John Hearns wrote:
>
> Ronan, as far as I can see this means that you cannot launch a job.
>
> What state are
Ronan, as far as I can see this means that you cannot launch a job.
What state are the compute nodes in when you run sinfo?
On 17 July 2018 at 10:08, Buckley, Ronan wrote:
> Yes, srun just hangs. Commands like sinfo and squeue run fine.
>
> I also have no slurm logs in /var/log ??
>
>
>
> *Fro
Yes, srun just hangs. Commands like sinfo and squeue run fine.
I also have no slurm logs in /var/log ??
From: slurm-users [mailto:slurm-users-boun...@lists.schedmd.com] On Behalf Of
John Hearns
Sent: Tuesday, July 17, 2018 8:57 AM
To: Slurm User Community List
Subject: Re: [slurm-users] 'srun hos
Ronan, sorry to ask but this is a bit unclear.
Are you unable to launch ANY sessions with srun?
In which case you need to look at the logs to see why the job is not being
scheduled.
Is it only the hostname command which fails?
I would guess very much you have already run an ssh into a node and r
Yes I do.
From: slurm-users [mailto:slurm-users-boun...@lists.schedmd.com] On Behalf Of
Williams, Gareth (IM&T, Clayton)
Sent: Tuesday, July 17, 2018 12:33 AM
To: Slurm User Community List
Subject: Re: [slurm-users] 'srun hostname' hangs on the command line
Do you get the same problem as a non-r