Re: [slurm-users] Checking memory requirements in job_submit.lua

Prentice Bisbal Thu, 14 Jun 2018 10:43:48 -0700


On 06/13/2018 01:59 PM, Prentice Bisbal wrote:

In my environment, we have several partitions that are 'generalaccess', with each partition providing different hardware resources(IB, large mem, etc). Then there are other partitions that are forspecific departments/projects. Most of this configuration ishistorical, and I can't just rearrange the partition layout, etc,which would allow Slurm to apply it's own logic to redirect jobs tothe appropriate nodes.
For the general access partitions, I've decided apply some of thislogic in my job_submit.lua script. This logic would look at some ofthe job specifications and change the QOS/Partition for the job asappropriate. One thing I'm trying to do is have large memory jobs beassigned to my large memory partition, which is named mque forhistorical reasons.
To do this, I have added the following logic to my job_submit.lua script:

if job_desc.pn_min_mem > 65536 then
slurm.user_msg("NOTICE: Partition switched to mque due to memoryrequirements.")
    job_desc.partition = 'mque'
    job_desc.qos = 'mque'
    return slurm.SUCCESS
end
This works when --mem is specified, doesn't seem to work when--mem-per-cpu is specified. What is the best way to check this when--mem-per-cpu is specified instead? Logically, one would have tocalculate
mem per node = ntasks_per_node * ( ntasks_per_core / min_mem_per_cpu )
Is correct? If so, are there any flaws in the logic/variable namesabove? Also, is this quantity automatically calculated in Slurm by avariable that is accessible by job_submit.lua at this point, or do Ineed to calculate this myself?

I've given up on calculating mem per node when --mem-per-cpu isspecified. I was hoping to do this to protect my users from themselves,but the more I think about this, the more this looks like a fool's errand.


Prentice

Re: [slurm-users] Checking memory requirements in job_submit.lua

Reply via email to