Hello Rémy,

Indeed, libdrmaa was linked against the wrong version of libslurm:

ldd /usr/local/lib/libdrmaa.so.1.0.8
    linux-vdso.so.1 =>  (0x00007ffe17b8b000)
    libslurm.so.36 => /usr/local/lib/libslurm.so.36 (0x00007f237179f000)
    libdl.so.2 => /lib64/libdl.so.2 (0x00007f237159b000)
    libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f237137f000)
    libm.so.6 => /lib64/libm.so.6 (0x00007f237107d000)
    libresolv.so.2 => /lib64/libresolv.so.2 (0x00007f2370e63000)
    libc.so.6 => /lib64/libc.so.6 (0x00007f2370a95000)
    /lib64/ld-linux-x86-64.so.2 (0x00007f2371d7d000)

I just had to recompile drmaa to make it work again.

Somehow the error message I was getting was misleading. The fix was very simple actually.

Thanks for your help.

Best.

J.

Le 05/10/2023 à 10:15, Rémi Palancher a écrit :
Hello Julien,

Le mercredi 4 octobre 2023 à 19:04, Julien Rey 
<julien....@univ-paris-diderot.fr> a écrit :

Hello,

I did an upgrade of Slurm this week (20.11 to 21.08.8) and while
everything seems to be working with srun and sbatch commands, here is
what I get when I try to launch jobs from drmaa library:

…

I don't know if this is a slurm or a drmaa bug. So any advice would be
welcome.
Slurm daemons, binaries and libraries check the version of the plugins matches 
their own version at load time. The version of the plugins is bumped on every 
major version of Slurm (eg. 21.08) hence plugins compiled with 21.08 cannot be 
loaded by programs linked with libslurm from Slurm 20.11.

I suspect in this case DRMMA to be compiled and linked on libslurm from Slurm 
20.11 trying (and failing) to load newer plugins provided with Slurm 21.08.

Did you try to recompile your DRMMA layer against Slurm 21.08.8 headers and 
library?

--
Rémi Palancher
Rackslab: Open Source Solutions for HPC Operations
https://rackslab.io


--
Julien Rey

Plate-forme RPBS
Unité BFA - CMPLI
Université de Paris
tel: 01 57 27 83 95


Reply via email to