Re: [slurm-users] detectCores() mess

2017-12-08 Thread Chris Samuel
On 9/12/17 4:54 am, Mike Cammilleri wrote: I thought cgroups (which we are using) would prevent some of this behavior on the nodes (we are constraining CPU and RAM) -I'd like there to be no I/O wait times if possible. I would like it if either linux or slurm could constrain a job from grabbing m

[slurm-users] Invoke squeue sort on submit time?

2017-12-08 Thread E.M. Dragowsky
Greetings -- According to the documentation, I ought to be able to apply the format options as arguments for the sort flag of squeue. It turns out (as seems consistent with the docs slurm.schedmd.com/squeue.html) that the flag for submit time (-o "%V") cannot be used as an argument to sort squeue

Re: [slurm-users] detectCores() mess

2017-12-08 Thread Jeff White
On 12/08/2017 09:54 AM, Mike Cammilleri wrote: Hi, We have allowed some courses to use our slurm cluster for teaching purposes, which of course leads to all kinds of exciting experiments - not always the most clever programming but it certainly teaches me where we need tighten up configuratio

Re: [slurm-users] Strange problem with Slurm 17.11.0: "batch job complete failure"

2017-12-08 Thread Andy Riebs
Answering my own question, I got private email which points to , describing both the problem and the solution. (Thanks Matthieu!) Andy On 12/08/2017 11:06 AM, Andy Riebs wrote: I've gathered more information, and I am probably having a fight wi

[slurm-users] detectCores() mess

2017-12-08 Thread Mike Cammilleri
Hi, We have allowed some courses to use our slurm cluster for teaching purposes, which of course leads to all kinds of exciting experiments - not always the most clever programming but it certainly teaches me where we need tighten up configurations. The default method of thinking for many stud

Re: [slurm-users] Strange problem with Slurm 17.11.0: "batch job complete failure"

2017-12-08 Thread Andy Riebs
I've gathered more information, and I am probably having a fight with pam.  First, of note, this problem can be reproduced with a single node, single task job, such as $ sbatch -N1 --reservation awr #!/bin/bash hostname Submitted batch job 90436 $ sinfo -R batch job complete f slurm 2017-12

Re: [slurm-users] Slurm fair share priority not being applied

2017-12-08 Thread Bruno Santos
Hi Lachian, The problem ended up with the mysql slurmdbd not having the users setup. Once I added the user with saccount and restarted the daemons everything is working. Or at least until I find the next problem. Best, Bruno On 8 December 2017 at 00:51, Lachlan Musicman wrote: > On 1 December