call right now but oom might be working. dmesg -T | grep oom to see if
the OS is wiping out jobs to recover memory.
Doug
On Mon, Apr 3, 2023, 8:56 AM Robert Barton wrote:
Hello,
I'm looking for help in understanding a problem we're having such that
Slurm indicates that a job was k
Hello,
I'm looking for help in understanding a problem we're having such that
Slurm indicates that a job was killed, but not why. It's not clear
what's actually killing the jobs; we've seen jobs killed for time limits
and out-of-memory issues, and those reasons are obvious in the logs when
th