Re: [slurm-users] Kill job when child process gets OOM-killed

2021-06-10 Thread Arthur Gilly
To: Slurm User Community List Subject: Re: [slurm-users] Kill job when child process gets OOM-killed Yep, those are reasons not to create the array of 100k jobs. >From https://www.mail-archive.com/slurm-users@lists.schedmd.com/msg04092.html >, deeper in the thread from one o

Re: [slurm-users] Kill job when child process gets OOM-killed

2021-06-09 Thread Renfro, Michael
on behalf of Arthur Gilly Date: Tuesday, June 8, 2021 at 10:00 PM To: 'Slurm User Community List' Subject: Re: [slurm-users] Kill job when child process gets OOM-killed External Email Warning This email originated from outside the university. Please use caution when op

Re: [slurm-users] Kill job when child process gets OOM-killed

2021-06-08 Thread Arthur Gilly
: slurm-users On Behalf Of Renfro, Michael Sent: Tuesday, 8 June 2021 20:12 To: Slurm User Community List Subject: Re: [slurm-users] Kill job when child process gets OOM-killed Any reason *not* to create an array of 100k jobs and let the scheduler just handle things? Current versions of Slurm

Re: [slurm-users] Kill job when child process gets OOM-killed

2021-06-08 Thread Renfro, Michael
on behalf of Arthur Gilly Date: Tuesday, June 8, 2021 at 4:12 AM To: 'Slurm User Community List' Subject: Re: [slurm-users] Kill job when child process gets OOM-killed External Email Warning This email originated from outside the university. Please use caution when opening attachm

Re: [slurm-users] Kill job when child process gets OOM-killed

2021-06-08 Thread Arthur Gilly
MGU) - From: slurm-users On Behalf Of Loris Bennett Sent: Tuesday, 8 June 2021 16:05 To: Slurm User Community List Subject: Re: [slurm-users] Kill job when child process gets OOM-killed Dear Arthur, Arthur Gilly mailto:arthur.gi...@helmholtz-muenchen.de> > writes: > Dear

Re: [slurm-users] Kill job when child process gets OOM-killed

2021-06-08 Thread Loris Bennett
Dear Arthur, Arthur Gilly writes: > Dear Slurm users, > > > > I am looking for a SLURM setting that will kill a job immediately when any > subprocess of that job hits an OOM limit. Several posts have touched upon > that, e.g: > https://www.mail-archive.com/slurm-users@lists.schedmd.com/msg0

[slurm-users] Kill job when child process gets OOM-killed

2021-06-08 Thread Arthur Gilly
Dear Slurm users, I am looking for a SLURM setting that will kill a job immediately when any subprocess of that job hits an OOM limit. Several posts have touched upon that, e.g: https://www.mail-archive.com/slurm-users@l