[slurm-users] One time override to force run job

2019-09-02 Thread Tina Fora
Hello, Is there a way to force a job to run that is being held back for QOSGrpCpuLimit? This is coming from QOS that we have in place. For the most part it works great but every once in a while we have free nodes that are idle and I'd like to force the job to run. Tina

[slurm-users] How can jobs request a minimum available (free) TmpFS disk space?

2019-09-02 Thread Ole Holm Nielsen
We have some users requesting that a certain minimum size of the *Available* (i.e., free) TmpFS disk space should be present on nodes before a job should be considered by the scheduler for a set of nodes. I believe that the "sbatch --tmp=size" option merely refers to the TmpFS file system *Siz

[slurm-users] Power/Cloud Plugin - Race Condition after Node Start - Wrong Job State

2019-09-02 Thread Felix Wolfheimer
Just stumbled on an issue which kicks in occasionally when Slurm starts/creates instances using the power/cloud plugin. Here is what happens: I'm using the Slurm Power/Cloud plugin to create compute instances on demand. Occasionally it happens that I run into the following situation when new inst