Hello everyone, I am using Slurm as a workload manager on a system with a master and 3 nodes. The operating system used is the recent rocky linux 8.4 while for slurm, is used the version 20.11.8 taken from EPEL repository. Everything works correctly and when the system is started the command "systemctl start slurmctld" works fine, but at boot the daemon slurmctld does not start on the master machine, reporting a series of errors. Without reporting all the slurmctld.log the recurring error is the following:
[2021-07-23T09:58:01.932] error: get_addr_info: getaddrinfo() failed: Name or service not known [2021-07-23T09:58:01.932] error: slurm_set_addr: Unable to resolve "blade01" [2021-07-23T09:58:01.932] error: slurm_get_port: Address family '0' not supported [2021-07-23T09:58:01.932] error: _set_slurmd_addr: failure on blade01 In this case I have set it in the slurm.conf file, for simplicity, "AccountingStorageType=accounting_storage/none", but also using the slurmdbd/mariadb support is all right with no problems, but slurmctld still does not start on boot. Also in the log reported blade01 is the hostname of one of the nodes. I have already read some messages that reported a similar problem, but none of the considerations I read helped me to overcome the problem. Is there anyone who can help me find a solution? Greetings to all Riccardo -- ********************************************************** Riccardo Sucapane Dip. MEMOTEF - Sapienza Università di Roma Via del Castro Laurenziano, 9 - 00161 - Roma Tel. 06 4976 6846 ********************************************************** -- ________________________________________________________ Le informazioni contenute in questo messaggio di posta elettronica sono strettamente riservate e indirizzate esclusivamente al destinatario. Si prega di non leggere, fare copia, inoltrare a terzi o conservare tale messaggio se non si è il legittimo destinatario dello stesso. Qualora tale messaggio sia stato ricevuto per errore, si prega di restituirlo al mittente e di cancellarlo permanentemente dal proprio computer. The information contained in this e mail message is strictly confidential and intended for the use of the addressee only. If you are not the intended recipient, please do not read, copy, forward or store it on your computer. If you have received the message in error, please forward it back to the sender and delete it permanently from your computer system. -- Fai crescere i nostri giovani ricercatori dona il 5 per mille alla Sapienza *codice fiscale 80209930587*