Re: [slurm-users] 20.11.1 on Cray: job_submit.lua: SO loaded on CtlD restart: script skipped when job submitted

2020-12-17 Thread Kevin Buckley
On 2020/12/17 11:34, Chris Samuel wrote: On 16/12/20 6:21 pm, Kevin Buckley wrote: The skip is occuring, in src/lua/slurm_lua.c, because of this trap That looks right to me, that's Doug's code which is checking whether the file has been updated since slurmctld last read it in. If it has then

[slurm-users] exclusive use of GPU

2020-12-17 Thread Sartoratti Lorenzo
Hi all, we are using slurm with a series of generic computing servers. We would like to add a server with 10 gpu. Some of these gpus are owned by specific users because they were purchased with personal research funds, the other gpus would be in use by all slurm users. Is it possible to configu

Re: [slurm-users] [EXT] slurm/munge problem: invalid credentials

2020-12-17 Thread Olaf Gellert
Hi Sean, Sean Crosby wrote on 16.12.20 10:09: Check the firewalls between your compute node and the Slurm controller to make sure that they can contact each other. Slurmctld needs to contact the SlurmdPort (default 6818), and slurmd needs to contact the SlurmctldPort (default 6817). Also the o