Hi Riccardo.

I've had a similar problem (slurm.conf is served via NFS share). I just modified slurmd unit:
#systemctl edit slurmd
[Unit]
Requires=network-online.target
After=home.mount

HIH

Diego

Il 23/07/2021 12:29, Riccardo Sucapane ha scritto:
Hello everyone,
I am using Slurm as a workload manager on a system
with a master and 3 nodes.
The operating system used is the recent rocky linux 8.4
while for slurm, is used the version 20.11.8 taken from EPEL
repository.
Everything works correctly and when the system is started the command
"systemctl start slurmctld" works fine, but at boot the daemon
slurmctld does not start on the master machine, reporting a series of errors. Without reporting all the slurmctld.log the recurring error is the following:

[2021-07-23T09:58:01.932] error: get_addr_info: getaddrinfo() failed: Name or service not known
[2021-07-23T09:58:01.932] error: slurm_set_addr: Unable to resolve "blade01"
[2021-07-23T09:58:01.932] error: slurm_get_port: Address family '0' not supported
[2021-07-23T09:58:01.932] error: _set_slurmd_addr: failure on blade01


In this case I have set it in the slurm.conf file, for simplicity,
"AccountingStorageType=accounting_storage/none", but also using the
slurmdbd/mariadb support is all right with no problems, but slurmctld
still does not start on boot.
Also in the log reported blade01 is the hostname of one of the nodes.

I have already read some messages that reported a similar problem,
but none of the considerations I read helped me to overcome the problem.
Is there anyone who can help me find a solution?
Greetings to all
Riccardo

--
**********************************************************
Riccardo Sucapane
Dip. MEMOTEF - Sapienza Università di Roma
Via del Castro Laurenziano, 9 - 00161 - Roma
Tel. 06 4976 6846
**********************************************************

________________________________________________________
Le informazioni contenute in questo messaggio di posta elettronica sono strettamente riservate e indirizzate esclusivamente al destinatario. Si prega di non leggere, fare copia, inoltrare a terzi o conservare tale messaggio se non si è il legittimo destinatario dello stesso. Qualora tale messaggio sia stato ricevuto per errore, si prega di restituirlo al mittente e di cancellarlo permanentemente dal proprio computer. The information contained in this e mail message is strictly confidential and intended for the use of the addressee only.  If you are not the intended recipient, please do not read, copy, forward or store it on your computer. If you have received the message in error, please forward it back to the sender and delete it permanently from your computer system.
------------------------------------------------------------------------


Fai crescere i nostri giovani ricercatori
dona il 5 per mille alla Sapienza
*codice fiscale 80209930587*

--
Diego Zuccato
DIFA - Dip. di Fisica e Astronomia
Servizi Informatici
Alma Mater Studiorum - Università di Bologna
V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
tel.: +39 051 20 95786

Reply via email to