Re: [slurm-users] Slurm - UnkillableStepProgram

2021-03-22 Thread Chris Samuel
Hi Mike, On 22/3/21 7:12 pm, Yap, Mike wrote: # I presume UnkillableStepTimeout is set in slurm.conf. and it act as a timer to trigger UnkillableStepProgram That is correct. # UnkillableStepProgram   can be use to send email or reboot compute node – question is how do we configure it ? Al

[slurm-users] Slurm - UnkillableStepProgram

2021-03-22 Thread Yap, Mike
Hi All Have been reading on the archive hoping to implement unkillablesteptimeout and unkillablesteprogram to the slurm But I'm kind of confuse with it application 1. I presume UnkillableStepTimeout is set in slurm.conf. and it act as a timer to trigger UnkillableStepProgram 2. Unkillab

[slurm-users] Slurm prolog export variable

2021-03-22 Thread Yap, Mike
Hi All Can anyone assist the following We're using Bright Cluster 9.1 with CentOS 7.9 running with slurm 2.02.6 We have a script running on prolog exporting the SCRATCH as variable for user running job Addition command on the script to create a user folder accordingly When submitting the job, t

Re: [slurm-users] [External] srun at front-end nodes with --enable_configless fails with "Can't find an address, check slurm.conf"

2021-03-22 Thread Matthew BETTINGER
Also check the settings on your nodeaddr in slurm.conf On 3/22/21, 2:48 PM, "slurm-users on behalf of Michael Robbert" wrote: I haven't tried configless setup yet, but the problem you're hitting looks like it could be a DNS issue. Can you do a dns lookup of n26 from the login node? The w

Re: [slurm-users] [External] srun at front-end nodes with --enable_configless fails with "Can't find an address, check slurm.conf"

2021-03-22 Thread Michael Robbert
I haven't tried configless setup yet, but the problem you're hitting looks like it could be a DNS issue. Can you do a dns lookup of n26 from the login node? The way that non-interactive batch jobs are started may not require that, but I believe that it is required for interactive jobs. Mike Ro

[slurm-users] srun at front-end nodes with --enable_configless fails with "Can't find an address, check slurm.conf"

2021-03-22 Thread Josef Dvoracek
Hi @list; I was able to configure "configless" slurm cluster with quite minimalistic slurm.conf everywhere, of-course excepting slurmctld server. All nodes are running slurmd, including front-end/login nodes to pull the config. Submitting jobs using sbatch scripts works fine, but interactive

Re: [slurm-users] Set Fairshare by Hand

2021-03-22 Thread Luke Yeager
I asked something similar a few months ago and wasn't able to find anything to suit my needs. https://groups.google.com/g/slurm-users/c/ude1M5w_4IU/m/R2GziD9JAQAJ Good luck! Luke -Original Message- From: slurm-users On Behalf Of Paul Edmon Sent: Monday, March 22, 2021 6:31 AM To: slurm

Re: [slurm-users] Slurm version 20.11.5 is now available

2021-03-22 Thread Brian Andrus
I create the link at job runtime. For my case, I use the prologue to validate the link since I know ahead of time what is needed based on job templates that are used. In the more general case, I would have it be part of the batch script the user runs. Just ensure they clean up afterwards (I u

Re: [slurm-users] Set Fairshare by Hand

2021-03-22 Thread Paul Edmon
No, there is no way to my knowledge to do this.  You can zero out some one's fairshare (by removing and readding them) or a groups fairshare but you can't set it to an arbitrary value. You can always adjust their RawShares for a somewhat similar effect but that will have all the normal consequ

[slurm-users] Set Fairshare by Hand

2021-03-22 Thread Michael Müller
Dear Slurm users and admins, can we set the faireshare values manually, i.e., they are not (re)calculated be Slurm? With kind regards Michael -- Michael Müller Application Developer Department of System and Service Design (SDE) Center of Information Services and High Performance Computing (ZIH