Thanks Diego actually, nothing at all in the hosts file, did not seem to need to modify it to see the nodes. the different case on one of the nodes was an experiment to see if the names were in fact case-sensitive
but all networking functions between the nodes, with say munge, all seem to work just not slurmctld taking to the nodes, even though chatter can be seen between them in the log with a higher log level set Steve Bland Technical Product Manager Third Party Products Ross Video | Production Technology Experts T: +1 (613) 228-0688 ext.4219 www.rossvideo.com<http://www.rossvideo.com/> ________________________________ From: Diego Zuccato <diego.zucc...@unibo.it> Sent: 30 November 2020 02:20 To: Slurm User Community List <slurm-users@lists.schedmd.com>; Steve Bland <sbl...@rossvideo.com> Subject: Re: [slurm-users] [EXTERNAL] Re: trying to diagnose a connectivity issue between the slurmctld process and the slurmd nodes Il 27/11/20 17:18, Steve Bland ha scritto: > NodeName=SRVGRIDSLURM01 NodeAddr=192.168.1.60 CPUs=4 Boards=1 > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821 > NodeName=SRVGRIDSLURM02 NodeAddr=192.168.1.61 CPUs=4 Boards=1 > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821 > NodeName=srvgridslurm03 NodeAddr=192.168.1.62 CPUs=4 Boards=1 > SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=7821 The only issue I see here is that Slurm is case-sensitive. Maybe you have case-different names for the nodes in your /etc/hosts ? Just guessing, tho. -- Diego Zuccato DIFA - Dip. di Fisica e Astronomia Servizi Informatici Alma Mater Studiorum - Università di Bologna V.le Berti-Pichat 6/2 - 40127 Bologna - Italy tel.: +39 051 20 95786 ---------------------------------------------- This e-mail and any attachments may contain information that is confidential to Ross Video. If you are not the intended recipient, please notify me immediately by replying to this message. Please also delete all copies. Thank you.