Hi Riccardo.
I've had a similar problem (slurm.conf is served via NFS share). I just
modified slurmd unit:
#systemctl edit slurmd
[Unit]
Requires=network-online.target
After=home.mount
HIH
Diego
Il 23/07/2021 12:29, Riccardo Sucapane ha scritto:
Hello everyone,
I am using Slurm as a workload manager on a system
with a master and 3 nodes.
The operating system used is the recent rocky linux 8.4
while for slurm, is used the version 20.11.8 taken from EPEL
repository.
Everything works correctly and when the system is started the command
"systemctl start slurmctld" works fine, but at boot the daemon
slurmctld does not start on the master machine, reporting a series of
errors.
Without reporting all the slurmctld.log the recurring error is the
following:
[2021-07-23T09:58:01.932] error: get_addr_info: getaddrinfo() failed:
Name or service not known
[2021-07-23T09:58:01.932] error: slurm_set_addr: Unable to resolve "blade01"
[2021-07-23T09:58:01.932] error: slurm_get_port: Address family '0' not
supported
[2021-07-23T09:58:01.932] error: _set_slurmd_addr: failure on blade01
In this case I have set it in the slurm.conf file, for simplicity,
"AccountingStorageType=accounting_storage/none", but also using the
slurmdbd/mariadb support is all right with no problems, but slurmctld
still does not start on boot.
Also in the log reported blade01 is the hostname of one of the nodes.
I have already read some messages that reported a similar problem,
but none of the considerations I read helped me to overcome the problem.
Is there anyone who can help me find a solution?
Greetings to all
Riccardo
--
**********************************************************
Riccardo Sucapane
Dip. MEMOTEF - Sapienza Università di Roma
Via del Castro Laurenziano, 9 - 00161 - Roma
Tel. 06 4976 6846
**********************************************************
________________________________________________________
Le informazioni contenute in questo messaggio di posta elettronica sono
strettamente riservate e indirizzate esclusivamente al destinatario. Si
prega di non leggere, fare copia, inoltrare a terzi o conservare tale
messaggio se non si è il legittimo destinatario dello stesso. Qualora
tale messaggio sia stato ricevuto per errore, si prega di restituirlo al
mittente e di cancellarlo permanentemente dal proprio computer.
The information contained in this e mail message is strictly
confidential and intended for the use of the addressee only. If you are
not the intended recipient, please do not read, copy, forward or store
it on your computer. If you have received the message in error, please
forward it back to the sender and delete it permanently from your
computer system.
------------------------------------------------------------------------
Fai crescere i nostri giovani ricercatori
dona il 5 per mille alla Sapienza
*codice fiscale 80209930587*
--
Diego Zuccato
DIFA - Dip. di Fisica e Astronomia
Servizi Informatici
Alma Mater Studiorum - Università di Bologna
V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
tel.: +39 051 20 95786