Re: [slurm-users] Analyzing a stuck job
On 2/14/19 8:02 AM, Mahmood Naderan wrote: One job is in RH state which means JobHoldMaxRequeue. The output file, specified by --output shows nothing suspicious. Is there any way to analyze the stuck job? This happens when a job fails to start for MAX_BATCH_REQUEUE times (which is 5 at the mo
[slurm-users] Analyzing a stuck job
Hi, One job is in RH state which means JobHoldMaxRequeue. The output file, specified by --output shows nothing suspicious. Is there any way to analyze the stuck job? Regards, Mahmood