[slurm-users] Power saving is getting disabled in Slurm

2025-01-09 Thread Suman Deb via slurm-users
Hello, We are enabling global power saving for slurm. This is our config. But we are experiencing an issue where Slurm is talking nodes out of power saving mode. We are using config less and dynamic nodes. Slurmctrld logs 2025-01-08T09:54:18.096] Cleared POWER_SAVE flag from nodes dev3-cbf-debu

Re: [slurm-users] Power saving and node weight

2023-03-01 Thread Brian Andrus
Gizo, There is no documentation, only the bug report mentioned. We may be able to help from the list if you explain what you want to do. Brian On 3/1/2023 12:29 AM, Gizo Nanava wrote: Hello Brian, thanks a lot for the info. You may be able to use the alternate approach that I was able to d

Re: [slurm-users] Power saving and node weight

2023-03-01 Thread Gizo Nanava
Hello Brian, thanks a lot for the info. > > You may be able to use the alternate approach that I was able to do as well. > I would be insterested in any alternatives. Could you point me to some doc? Best wishes Gizo > Brian Andrus > > > On 2/28/2023 7:44 AM, Gizo Nanava wrote: > > Hello, >

Re: [slurm-users] Power saving and node weight

2023-02-28 Thread Brian Andrus
Gizo, I had that issue and opened a ticket. It is not considered a bug but a feature request. They have no plans to address it at this time. 9734 – Jobs sent to higher weight idle node instead of starting lower weight node (schedmd.com) You ma

[slurm-users] Power saving and node weight

2023-02-28 Thread Gizo Nanava
Hello, it seems that if a slurm power saving is enabled then the parameter "Weight" seem to be ignored for nodes that are in a power down state. Is there any way to make the option working for a cluster running slurm in a powe saveing mode?. I am aware of the note to the weight option in the

[slurm-users] Power saving method selection for different kinds of hardware

2022-11-08 Thread Ole Holm Nielsen
I'm thinking about the best way to configure power saving (see https://slurm.schedmd.com/power_save.html) when we have different types of node hardware whose power state have to be managed differently: 1. Nodes with a BMC NIC interface where "ipmitool chassis power ..." commands can be used.

Re: [slurm-users] Power saving

2022-07-28 Thread Benson Muite
On 7/28/22 18:49, Djamil Lakhdar-Hamina wrote: I am helping set up a 16 node cluster computing system, I am not a system-admin but I work for a small firm and unfortunately have to pick up needed skills fast in things I have little experience in. I am running Rocky Linux 8 on Intel Xeon Knights

[slurm-users] Power Saving Issue - Job B is executed before Job A - node not ready?

2020-12-23 Thread Eg. Bo.
Hello, Slurm Power Saving (19.05.) was configured successfuly within our Cloud environment. Jobs can be submitted and nodes get provisioned and deprovisioned as expected. Unfortunately there seems to be an edge case (or config issue :-D).After a job (jobA) is submitted to partition A, node provi