Re: [slurm-users] srun problem -- Can't find an address, check slurm.conf

2018-11-13 Thread Scott Hazelhurst
Dear Mercan Thank you! — yes different paths so different behaviour. Amazing how you can spend so much time looking at something and not seeing it. On Sunday did an upgrade from 17.11.10 to 17.11.12 to try to fix the problem but had left old binaries in a directory I should not have, so kept

Re: [slurm-users] srun problem -- Can't find an address, check slurm.conf

2018-11-13 Thread Scott Hazelhurst
Dear all I still haven’t found the cause to the problem I raised last week where srun -w xx runs for some nodes but not for others — thanks for the ideas. One intriguing result I’ve had trying to pursue this which I thought I’d share in case it sparks some ideas. If I give the full path for s

Re: [slurm-users] srun problem -- Can't find an address, check slurm.conf

2018-11-07 Thread Scott Hazelhurst
Thanks, Paul, yes, it does seem a likely cause, but I can’t see the problem. All machines have the same /etc/hosts file and the worker nodes are just listed one after each other. I’ve checked that the problem nodes are there — no obvious difference. I’ve checked that the IP address is correct.

[slurm-users] srun problem -- Can't find an address, check slurm.conf

2018-11-07 Thread Scott Hazelhurst
Dear list We have a relatively new installation of SLURM. We have started to have a problem with some of the nodes when using srun [scott@cream-ce ~]$ srun --pty -w n38 hostname srun: error: fwd_tree_thread: can't find address for host n38, check slurm.conf srun: error: Task launch for 18710.0