[slurm-users] Gres GPU Resource Issue

2020-05-15 Thread Speer, Andrew
I've run into a bit of an issue when trying to define GPU's in our slurm conf. Any insight is appreciated. Hopefully relevant lines from the configs below. Error: [2020-05-15T16:35:14.862] error: gres_plugin_node_config_unpack: No plugin configured to process GRES data from node node3 (Name:gpu

Re: [slurm-users] Node suspend / Power saving - for *idle* nodes only?

2020-05-15 Thread Steven Dick
I've had slurm power off a few nodes I was working on... My normal solution is to just power them back on without slurm's help. Then it brings the node up in state "down / unexpectedly booted" and it doesn't seem to mess with them until I use scontrol to change the state again. (I like scontrol re

Re: [slurm-users] Node suspend / Power saving - for *idle* nodes only?

2020-05-15 Thread Riebs, Andy
And if you're willing to buy a support contract with SchedMD, and/or provide a fix, it will be fixed. Otherwise, you'll have to accept that you've got a large group of users, just like you, who are willing to share their expertise and experience, even if it's not our "day job" -- or even our "ni

Re: [slurm-users] [External] Re: Node suspend / Power saving - for *idle* nodes only?

2020-05-15 Thread Florian Zillner
FWIW this is a known bug: https://bugs.schedmd.com/show_bug.cgi?id=5348 5348 – Suspending Nodes which are not in IDLE mode SchedMD - Slurm development and support. Providing support for some of the largest clusters in the world. bugs.schedmd.com __