James, you might take a look at CompleteWait and KillWait.

Regards,
Lyn

On Fri, Jan 3, 2020 at 12:27 PM Erwin, James <james.er...@intel.com> wrote:

> Hello,
>
>
>
> I’ve recently updated a cluster to SLURM 19.05.4 and notice that new jobs
> are starting on nodes still in the CG state. In an epilog I am running node
> health checks that last about 2-3 minutes. In the previous version (ancient
> 15.08), jobs would not start running on these nodes until the epilog was
> complete and the node is out of the CG state. Does anyone know why this
> overlap of R with CG might be happening?
>
>
>
> There is a release note for version 19.05.3 that looks possibly related
> but I’m not exactly sure what it means:
>
>
>
> * Changes in Slurm 19.05.3
>
> ==========================
>
> ...
>
> -- Nodes in COMPLETING state treated as being currently available for job
>
>     will-run test.
>
>
>
>
>
> Thanks,
>
> James
>
>
>

Reply via email to