Re: [slurm-users] Scheduler does not reserve resources

Jérémy Lapierre Wed, 19 Jan 2022 01:07:27 -0800


Hi Rodrigo,

We indeed have overlooked this. The problem is that in general our jobsneed more than 2 days of resources, that's why we select a wall time inthe batch scripts equal to the max wall time allowed by the partition.One thing we could try is to set the wall time at ~46h for the "light"jobs in the batch scripts and let 48h for the "heavy" jobs, this way notall jobs will have the same time limit.

Configuring node list for "light" and "heavy" jobs could do the trick. 2things that could probably be a problem then are (i) even "heavy" jobshaving very low Priority would have access to resources at the expenseof "light" jobs with higher priority and (ii) regular intervention wouldbe needed. But maybe there is no other solution.


I thank you a lot for your inputs !

Best,

Jeremy

Am 2022-01-19 01:46, schrieb Rodrigo Santibáñez:

Hi Jeremy,
If all jobs have the same time limit, backfill is impossible. Thedocumentation says: "Effectiveness of backfill scheduling is dependentupon users specifying job time limits, otherwise all jobs will have thesame time limit and backfilling is impossible". I don't know toovercome that...
However, without changing SchedulerType, you could hold pending jobsexcept for the job you want to execute, then release all jobs when thedesired job is allocated. Also, you could define a node or list ofnodes available for all jobs excluding nodes for the job of interest,then remove the configuration when the latter is allocated. I preferredto do the second because the "heavy" job and the "light" jobs will beallocated, and I have not to be aware of the queue outside office hours(Again, easier to do in a low utilized cluster).
About "PLANNED", I wasn't aware, and it is a feature of SLURM 21.08.Could be that why you don't see it in your cluster?
Best,
On Mon, Jan 17, 2022 at 2:02 PM Jérémy Lapierre<jeremy.lapie...@uni-saarland.de> wrote:
Hi Rodrigo and Rémi,
I had a similar behavior a long time ago, and I decided to setSchedulerType=sched/builtin to empty Xnodes of jobs and execute that high-priority job requesting more thanone node. It is not ideal, but thecluster has low load, so a user that requests more than one nodedoesn't delay too much the execution
of other's jobs.
I don't think this would be ideal in our case as we have heavy loads.Also I'm not sure if you mean that we should switch toSchedulerType=sched/builtin permanently or just the time needed forthe jobs causing problem to be allocated ? Also we have some otherexperiences on another cluster and slurm should normally reserveresources we think.
Backfilling doesn't delay the scheduled start time of higher priorityjobs,
but at least they must have a scheduled start time.
Did you check the start time of your job pending with Resourcesreason? eg.
with `scontrol show job <id> | grep StartTime`.
Yes, the scheduled start time have been checked as well, and this timeis updated through time such that jobs asking for 1/4 of a node canrun on a freshly-free-1/4th-node. This is why I'm saying that the jobsasking for several nodes (tested with 2 nodes here) are pendingforever. It is like slurm never wants to have unused resources (whichalso makes sense, but how can we satisfy "heavy" resources requestthen ?). On another cluster using slurm, I know that slurm reservesnodes and the node state of those reserved nodes becomes "PLANNED" (orplnd), this way jobs requesting for more resources than available atthe time of submission can later be satisfied. This never happens onthe cluster which is causing issues.
Sometimes Slurm is unable to define the start time of a pending job.One
typical reason is the absence of timelimit on the running jobs.
In t his case Slurm is unable to define when the running jobs areover,when the next highest priority job can start and eventually unable todefine
if lower priority jobs actually delay higher priority jobs.
Yes we always set up the time limit of our jobs to the max time limitallowed by the partition.
Thanks for your help,

Jeremy

Re: [slurm-users] Scheduler does not reserve resources

Reply via email to