Loris Bennett writes:
> Hi,
>
> With version 16.05.10-2, the option '--units' get applied incorrectly to
> the column 'NNodes':
>
> $ sacct -u user1234 -o jobid,nnodes,ncpus,reqmem,maxrss,elapsed -S 2017-07-01
> --units=G
>JobID NNodes NCPUS ReqMem MaxRSSElapsed
> ---
Hoping someone will get eyes on this one. I ended up changing the partition
in question to only use 1 thread per core to keep things simple, but it
would still be nice to know why slurm is looking at TRES hours instead of
RawUsage.
thanks.
-John
On Wed, Nov 15, 2017 at 10:55 AM, John Roberts
wro
Hello when i run job with srun and if my node reboot i have this error after 1
minute :
srun: error: Timed out waiting for job step to complete
it's possible to change timeout to 3600seconds for example ? to force slurmctld
to wait that my node reboot ?
i try
srun --wait 3600
or i add in m