[slurm-users] How to partition nodes into smaller units

2019-02-05 Thread Ansgar Esztermann-Kirchner
Hello List, we're operating a large-ish cluster (about 900 nodes) with diverse hardware. It has been running with SGE for several years now, but the more we refine our configuration, the more we're feeling SGE's limitations. Therefore, we're considering switching to Slurm. The latest challenge i

Re: [slurm-users] Mysterious job terminations on Slurm 17.11.10

2019-02-05 Thread Andy Riebs
Excellent suggestions Chris, but they didn't pan out (and I had such high hopes for them!). As for slurm.conf, here's the output from "scontrol show config" (I've included the original problem report below): Configuration data as of 2019-02-05T14:29:11 AccountingStorageBackupHost = (null) Acc

Re: [slurm-users] Error in job_submit.lua conditional?

2019-02-05 Thread mercan
Hi; I think dirty debugging is required using printf (slurm.log_user), because the lua of our slurm installation returns a lot of variables as nil. You can limit the output to a specific user as below: if job_desc.user_name == "mercan" then     slurm.log_user("job_desc.user_id=")     slurm.l

Re: [slurm-users] Error in job_submit.lua conditional?

2019-02-05 Thread Marcus Wagner
Hmm..., no, I was wrong. IT IS 'user_id'. Now I'm a bit dazzled Marcus On 2/4/19 11:27 PM, Prentice Bisbal wrote: Can anyone see an error in this conditional in my job_submit.lua?     if ( job_desc.user_id == 28922 or job_desc.user_id == 41266 ) and ( job_desc.partition == 'general' or job

Re: [slurm-users] Error in job_submit.lua conditional?

2019-02-05 Thread Marcus Wagner
Hi Prentice, I also hate lua sometimes, as it does not complain, when you hope it would complain. It is called 'userid', not 'user_id', so the first part is all the time false ;) Best Marcus On 2/4/19 11:27 PM, Prentice Bisbal wrote: Can anyone see an error in this conditional in my jo