Re: [slurm-users] Can't find an address

2018-10-27 Thread Zohar Roe MLM
: [slurm-users] Can't find an address In addition to these other suggestions, keep in mind the slurmd's will talk to each other if you have more then 50 nodes(see TreeWidth in slurm.conf), so this will require the nodes to be able to DNS lookup and communicate to all the other nodes

Re: [slurm-users] Can't find an address

2018-10-25 Thread Eli V
same name that Slurm > expects on your compute nodes. > > > From: Zohar Roe Mlm > Sent: Thursday, October 25, 2018 3:02AM > To: 'Slurm User Community List' > Cc: > Subject: Re: [slurm-users] Can't find an address > > H

Re: [slurm-users] Can't find an address

2018-10-25 Thread Andy Riebs
*Cc:* *Subject:* Re: [slurm-users] Can't find an address Hi Lachlan, Thanks for the replay. I am trying to find more Ideas for this problem. May be some system or strange communication problem. As for your suggestion: Check that it's in /etc/hosts --> It is. And answer to ping

Re: [slurm-users] Can't find an address

2018-10-25 Thread Zohar Roe MLM
the server can't find it (And it happen every two minute, always). Thanks for your ideas, Roy. From: slurm-users [mailto:slurm-users-boun...@lists.schedmd.com] On Behalf Of Lachlan Musicman Sent: Thursday, October 25, 2018 1:59 AM To: Slurm User Community List Subject: Re: [slurm-use

Re: [slurm-users] Can't find an address

2018-10-24 Thread Lachlan Musicman
On Wed, 24 Oct 2018 at 22:56, Zohar Roe MLM wrote: > Hello, > > I have a node that from some reason change state to "Down" evert few > minutes. > > When I change it with scontrol to "resume" its ok until Down again. > > In the slurm server log I can see error: > > "agent/is_node_resp: node:myName

[slurm-users] Can't find an address

2018-10-24 Thread Zohar Roe MLM
Hello, I have a node that from some reason change state to "Down" evert few minutes. When I change it with scontrol to "resume" its ok until Down again. In the slurm server log I can see error: "agent/is_node_resp: node:myName1 RPC:REQUEST_PING : Can't find an address, check slurm.conf" Now, The