Hi.
Maybe your jobs are requesting more RAM (or other resources) that after
6 other jobs are no longer available on first node?
Try checking with scontrol show node .
BYtE,
Diego
Il 06/08/2021 08:52, Jack Chen ha scritto:
I'm using slurm15.08.11, when I submit several 1 gpu jobs, slurm doesn't
allocate nodes using compact strategy. Anyone know how to solve this?
Will upgrading slurm latest version help ?
For example, there are two nodes A and B with 8 gpus per node, I
submitted 8 1 gpu jobs, slurm will allocate first 6 jobs on node A, then
last 2 jobs on node B. Then when I submit one job with 8 gpus, it will
pending because of gpu fragments: nodes A has 2 idle gpus, node b 6 idle
gpus
Thanks in advance!
--
Diego Zuccato
DIFA - Dip. di Fisica e Astronomia
Servizi Informatici
Alma Mater Studiorum - UniversitĂ di Bologna
V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
tel.: +39 051 20 95786