Re: [slurm-users] slurm-users Digest, Vol 66, Issue 6

2023-04-05 Thread Robert Barton
call right now but oom might be working. dmesg -T | grep oom to see if the OS is wiping out jobs to recover memory. Doug On Mon, Apr 3, 2023, 8:56 AM Robert Barton wrote: Hello, I'm looking for help in understanding a problem we're having such that Slurm indicates that a job was k

[slurm-users] Job killed for unknown reason

2023-04-03 Thread Robert Barton
Hello, I'm looking for help in understanding a problem we're having such that Slurm indicates that a job was killed, but not why. It's not clear what's actually killing the jobs; we've seen jobs killed for time limits and out-of-memory issues, and those reasons are obvious in the logs when th