[slurm-users] Is there a way to list allocated/unallocated resources defined in a QoS?

2024-02-06 Thread Alastair Neil via slurm-users
Slurm version 23.02.07 If I have a QoS defined that has a set number of say GPU devices set in the GrpTRES. Is there an easy way to generate a list of how much of the defined quota is allocated or conversely un-allocated? e.g.: Name|Priority|GraceTime|Preempt|PreemptExemptTime|PreemptMode|Flags|

[slurm-users] what is the default limit behaviour for a qos with GrpTRES defined

2023-11-27 Thread Alastair Neil
specified in the GrpTRES set to a limit of 0 by default - or are they unlimited? In other words is it sufficient to set a limit on the one specific gres device, or do I also have to explicitly set a limit of 0 on all other gres devices? Thanks, -Alastair Neil

Re: [slurm-users] additional jobs killed by scancel.

2020-05-13 Thread Alastair Neil
invalid field requested: "reason" On Tue, 12 May 2020 at 16:47, Steven Dick wrote: > What do you get from > > sacct -o jobid,elapsed,reason,exit -j 533900,533902 > > On Tue, May 12, 2020 at 4:12 PM Alastair Neil > wrote: > > > > The log is continuous a

Re: [slurm-users] additional jobs killed by scancel.

2020-05-12 Thread Alastair Neil
an slurm liked, so it took the node offline and killed > everything on it. > > On Mon, May 11, 2020 at 12:55 PM Alastair Neil > wrote: > > > > Hi there, > > > > We are using slurm 18.08 and had a weird occurrence over the weekend. A > user canceled one

[slurm-users] additional jobs killed by scancel.

2020-05-11 Thread Alastair Neil
Hi there, We are using slurm 18.08 and had a weird occurrence over the weekend. A user canceled one of his jobs using scancel, and two additional jobs of the user running on the same node were killed concurrently. The jobs had no dependency, but they were all allocated 1 gpu. I am curious to kno

[slurm-users] creating a reservation for a gres resources e.g. GPU?

2020-04-22 Thread Alastair Neil
Hi there, Slurm version 18.08 I am trying to find out if there is a way to add a specific gres, in this case a GPU to a reservation? I think I can reserve a portion of a node that has a specific gres quantity attached but I cannot figure out how to reserve the gres, so I cannot guarantee that i