On 11-06-2020 14:24, navin srivastava wrote:
Hi Team,

when i am trying to start the slurmd process i am getting the below error.

2020-06-11T13:11:58.652711+02:00 oled3 systemd[1]: Starting Slurm node daemon... 2020-06-11T13:13:28.683840+02:00 oled3 systemd[1]: slurmd.service: Start operation timed out. Terminating. 2020-06-11T13:13:28.684479+02:00 oled3 systemd[1]: Failed to start Slurm node daemon. 2020-06-11T13:13:28.684759+02:00 oled3 systemd[1]: slurmd.service: Unit entered failed state. 2020-06-11T13:13:28.684917+02:00 oled3 systemd[1]: slurmd.service: Failed with result 'timeout'. 2020-06-11T13:15:01.437172+02:00 oled3 cron[8094]: pam_unix(crond:session): session opened for user root by (uid=0)

Slurm version is 17.11.8

The server and slurm is runningĀ from long time and we have not made any changesĀ but today when i am starting it is giving this error message.
Any idea what could be wrong here.

Which OS do you run this ancient Slurm version on? There could be many reasons why slurmd refuses to start, such as network, DNS, firewall, etc.

You should check the log file in /var/log/slurm/

You could start the slurmd from the command line, adding one or more -v for verbose logging:

$ slurmd -v -v

/Ole

Reply via email to