This thread on the forums may help:

https://groups.google.com/g/slurm-users/c/YB55Ru9rvD4


It looks like you have something on your network with an older version of slurm 
installed. I'd check the Slurm version installed on your compute nodes and 
controllers.

The recommended approach to upgrading is to upgrade the SlurmDB first, then the 
controllers, then the compute nodes. More info here:

https://slurm.schedmd.com/quickstart_admin.html#upgrade

Regards
--
Mick Timony
Senior DevOps Engineer
Harvard Medical School
--

________________________________
From: slurm-users <slurm-users-boun...@lists.schedmd.com> on behalf of Wadud 
Miah <w.m...@soton.ac.uk>
Sent: Thursday, September 8, 2022 10:47 AM
To: slurm-users@lists.schedmd.com <slurm-users@lists.schedmd.com>
Subject: [slurm-users] Upgrading SLURM from 18 to 20.11.9

Hi,

I am attempting to upgrade from SLURM 18 to 20.11.9 and when I attempt to start 
slurmdbd (version 20.11.9), I get the following error messages in 
/var/log/slurm/slurmdbd.log:

[2022-09-08T15:45:11.115] slurmdbd version 20.11.9 started
[2022-09-08T15:45:23.001] error: unpack_header: protocol_version 8448 not 
supported
[2022-09-08T15:33:57.001] unpacking header
[2022-09-08T15:33:57.001] error: destroy_forward: no init
[2022-09-08T15:33:57.001] error: slurm_unpack_received_msg: Message receive 
failure
[2022-09-08T15:33:57.011] error: CONN:11 Failed to unpack SLURM_PERSIST_INIT 
message

Any help will be greatly appreciated.

Regards,

----------
Wadud Miah
Research Computing Support
University of Southampton

Reply via email to