QOS Group TRES limits apply to associations.
If I recall correctly, an association is a (user,account,partition,cluster)

On Fri, Oct 21, 2022 at 9:46 AM Matthew R. Baney <mba...@umd.edu> wrote:
>
> Hello,
>
> I have noticed that jobs submitted to non-preemptable partitions (PreemptType 
> = preempt/partition_prio and PreemptMode = REQUEUE) under accounts with 
> GrpTRES limits will become pending with AssocGrpGRES as the reason when the 
> account is up against the relevant limit, even when there are other running 
> jobs on preemptable partitions under the same account and when the pending 
> jobs have higher priority. The GRES in consideration are GPUs.
>
> It seems like the scheduler is checking to see if the pending jobs are within 
> the GRES limit for the account before considering if any of the other jobs in 
> the account are running on preemptable partitions. In some specific observed 
> cases, even preempting a single job running in a preemptable partition would 
> allow the non-preemptable partition job to run (based on GRES freed up from 
> preemption).
>
> Is it possible to reverse the order in which these checks are evaluated?
>
> Best,
> Matthew
>
> --
> Matthew Baney
> UMIACS Technical Staff
> mba...@umd.edu | (301) 405-6756
> University of Maryland Institute for Advanced Computer Studies
> 3154 Brendan Iribe Center
> 8125 Paint Branch Dr.
> College Park, MD 20742

Reply via email to