[slurm-users] [slurm 17.02] select/cray plugin from non-crays

2018-01-10 Thread Andrew Elwell
Hi folks, We've just upgraded to slurm 17.02.9 (native) on our Crays, but can't get sinfo to work on them anymore from a non-cray "sinfo: error: Cluster 'galaxy' has an unknown select plugin_id 108" On the Crays we have aelwell@galaxy-int:~/testjobs/native$ grep -i select /etc/opt/slurm/slurm.co

[slurm-users] logging question

2018-01-10 Thread Derek Yarnell
Hi, So we have been trying to track down GRES/cgroup issues that we have been having with multi-GPU space sharing. We were running 17.02.2 and upgraded to 17.02.9 and it looks like the some of the messages were changed from debug2 to info level based on the diff of src/plugins/task/cgroup/task_cg

[slurm-users] long wait until sacct has all data available

2018-01-10 Thread Stefan Bienert
Hello, I'm new to SLURM and on my second day stumbled upon something weird. When I use sacct, very often I get 'Unknown" for the qos used. After a while the qos is set to what I actually set it for the job. Specifically, I can run $ sacct -j 22047339 --parsable2 -o QOS QOS Unknown Wait 10 m