Slurm version 23.02.07
If I have a QoS defined that has a set number of say GPU devices set in the
GrpTRES. Is there an easy way to generate a list of how much of the
defined quota is allocated or conversely un-allocated?
e.g.:
Name|Priority|GraceTime|Preempt|PreemptExemptTime|PreemptMode|Flags|
specified in the GrpTRES set to a limit of 0 by default - or
are they unlimited?
In other words is it sufficient to set a limit on the one specific gres
device, or do I also have to explicitly set a limit of 0 on all other gres
devices?
Thanks,
-Alastair Neil
invalid field requested: "reason"
On Tue, 12 May 2020 at 16:47, Steven Dick wrote:
> What do you get from
>
> sacct -o jobid,elapsed,reason,exit -j 533900,533902
>
> On Tue, May 12, 2020 at 4:12 PM Alastair Neil
> wrote:
> >
> > The log is continuous a
an slurm liked, so it took the node offline and killed
> everything on it.
>
> On Mon, May 11, 2020 at 12:55 PM Alastair Neil
> wrote:
> >
> > Hi there,
> >
> > We are using slurm 18.08 and had a weird occurrence over the weekend. A
> user canceled one
Hi there,
We are using slurm 18.08 and had a weird occurrence over the weekend. A
user canceled one of his jobs using scancel, and two additional jobs of the
user running on the same node were killed concurrently. The jobs had no
dependency, but they were all allocated 1 gpu. I am curious to kno
Hi there,
Slurm version 18.08
I am trying to find out if there is a way to add a specific gres, in this
case a GPU to a reservation? I think I can reserve a portion of a node
that has a specific gres quantity attached but I cannot figure out how to
reserve the gres, so I cannot guarantee that i