Hi.

Maybe your jobs are requesting more RAM (or other resources) that after 6 other jobs are no longer available on first node?

Try checking with scontrol show node .

BYtE,
 Diego

Il 06/08/2021 08:52, Jack Chen ha scritto:
I'm using slurm15.08.11, when I submit several 1 gpu jobs, slurm doesn't allocate nodes using compact strategy. Anyone know how to solve this? Will upgrading slurm latest version help ?

For example, there are two nodes A and B with 8 gpus per node, I submitted 8 1 gpu jobs, slurm will allocate first 6 jobs on node A, then last 2 jobs on node B. Then when I submit one job with 8 gpus, it will pending because of gpu fragments: nodes A has 2 idle gpus, node b 6 idle gpus

Thanks in advance!

--
Diego Zuccato
DIFA - Dip. di Fisica e Astronomia
Servizi Informatici
Alma Mater Studiorum - UniversitĂ  di Bologna
V.le Berti-Pichat 6/2 - 40127 Bologna - Italy
tel.: +39 051 20 95786

Reply via email to