: [slurm-users] Can't find an address
In addition to these other suggestions, keep in mind the slurmd's will talk to
each other if you have more then 50 nodes(see TreeWidth in slurm.conf), so this
will require the nodes to be able to DNS lookup and communicate to all the
other nodes
same name that Slurm
> expects on your compute nodes.
>
>
> From: Zohar Roe Mlm
> Sent: Thursday, October 25, 2018 3:02AM
> To: 'Slurm User Community List'
> Cc:
> Subject: Re: [slurm-users] Can't find an address
>
> H
*Cc:*
*Subject:* Re: [slurm-users] Can't find an address
Hi Lachlan,
Thanks for the replay. I am trying to find more Ideas for this problem.
May be some system or strange communication problem.
As for your suggestion:
Check that it's in /etc/hosts --> It is. And answer to ping
the server
can't find it (And it happen every two minute, always).
Thanks for your ideas,
Roy.
From: slurm-users [mailto:slurm-users-boun...@lists.schedmd.com] On Behalf Of
Lachlan Musicman
Sent: Thursday, October 25, 2018 1:59 AM
To: Slurm User Community List
Subject: Re: [slurm-use
On Wed, 24 Oct 2018 at 22:56, Zohar Roe MLM wrote:
> Hello,
>
> I have a node that from some reason change state to "Down" evert few
> minutes.
>
> When I change it with scontrol to "resume" its ok until Down again.
>
> In the slurm server log I can see error:
>
> "agent/is_node_resp: node:myName
Hello,
I have a node that from some reason change state to "Down" evert few minutes.
When I change it with scontrol to "resume" its ok until Down again.
In the slurm server log I can see error:
"agent/is_node_resp: node:myName1 RPC:REQUEST_PING : Can't find an address,
check slurm.conf"
Now, The