Re: [slurm-users] Analyzing a stuck job

2019-02-14 Thread Christopher Samuel
On 2/14/19 8:02 AM, Mahmood Naderan wrote: One job is in RH state which means JobHoldMaxRequeue. The output file, specified by --output shows nothing suspicious. Is there any way to analyze the stuck job? This happens when a job fails to start for MAX_BATCH_REQUEUE times (which is 5 at the mo

[slurm-users] Analyzing a stuck job

2019-02-14 Thread Mahmood Naderan
Hi, One job is in RH state which means JobHoldMaxRequeue. The output file, specified by --output shows nothing suspicious. Is there any way to analyze the stuck job? Regards, Mahmood