Re: [slurm-users] slurm node weights

2019-09-05 Thread Merlin Hartley
I believe this is so that small jobs will naturally go on older, slower nodes first - leaving the bigger,better ones for jobs that actually need them. Merlin -- Merlin Hartley IT Support Engineer MRC Mitochondrial Biology Unit University of Cambridge Cambridge, CB2 0XY United Kingdom > O

Re: [slurm-users] maximum size of array jobs

2019-02-26 Thread Merlin Hartley
<https://github.com/SchedMD/slurm/commit/09c13fb292a4a6a56b4078de840aae0d4db70309> -- Merlin Hartley Computer Officer MRC Mitochondrial Biology Unit University of Cambridge Cambridge, CB2 0XY United Kingdom > On 26 Feb 2019, at 14:27, Jeffrey Frey wrote: > > Also see "

Re: [slurm-users] How to get the CPU usage of history jobs at each compute node?

2019-02-15 Thread Merlin Hartley
using sacct [1] - assuming you have accounting [2] enabled: sacct -j Hope this helps! Merlin [1] https://slurm.schedmd.com/sacct.html <https://slurm.schedmd.com/sacct.html> [2] https://slurm.schedmd.com/accounting.html <https://slurm.schedmd.com/accounting.html> -- Me

Re: [slurm-users] How to request ONLY one CPU instead of one socket or one node?

2019-02-15 Thread Merlin Hartley
Seems like you aren't specifying a --mem option, so the default would be to ask for a whole-node’s worth of RAM thus you would use the whole node for each job. Hope this is useful! Merlin -- Merlin Hartley Computer Officer MRC Mitochondrial Biology Unit University of Cambridge Cambridge

Re: [slurm-users] Reserve CPUs/MEM for GPUs

2019-02-15 Thread Merlin Hartley
You could instead only allow the cpu partition to use 192G RAM and 20 CPU on those nodes... -- Merlin Hartley > On 13 Feb 2019, at 07:38, Quirin Lohr wrote: > > Hi all, > > we have a slurm cluster running on nodes with 2x18 cores, 256GB RAM and 8 > GPUs. Is there a way

Re: [slurm-users] jobs stuck in ReqNodeNotAvail,

2017-11-29 Thread Merlin Hartley
damn autocorrect - I meant: # scontrol show job 6982 -- Merlin Hartley Computer Officer MRC Mitochondrial Biology Unit Cambridge, CB2 0XY United Kingdom > On 29 Nov 2017, at 16:08, Merlin Hartley > wrote: > > Can you give us the output of > # control show job 6982 > &

Re: [slurm-users] jobs stuck in ReqNodeNotAvail,

2017-11-29 Thread Merlin Hartley
Can you give us the output of # control show job 6982 Could be an issue with requesting too many CPUs or something… Merlin -- Merlin Hartley Computer Officer MRC Mitochondrial Biology Unit Cambridge, CB2 0XY United Kingdom > On 29 Nov 2017, at 15:21, Christian Anthon wrote: > > H

Re: [slurm-users] Query about Compute + GPUs

2017-11-21 Thread Merlin Hartley
Could you give us your submission command? It may be that you are requesting the wrong partition - i.e. relying on the default partition selection… try with “--partition cpu” M -- Merlin Hartley Computer Officer MRC Mitochondrial Biology Unit Cambridge, CB2 0XY United Kingdom > On 21