Hi All,
We have a flotilla of GPUs all protected by cgroups.
But occasionally we have users who _must_ ssh into the node.
Of course if they ssh in, then the cgroup protection doesn’t work (yes -
there’s a slurm plugin
to tie an ssh session to a cgroup, but that seems more problematic with 8-GP
Hi,
maybe I missed it, but what does squeue say in the reason field for
your pending jobs that you expect to slip in?
Is your partition maybe configured for exclusive node access, e.g. by
setting `OverSubscribe=EXCLUSIVE´?
Best regards
Jürgen
--
Jürgen Salk
Scientific Software & Compute Ser