Re: [slurm-users] bug 2119 with slurm 18.08.2

2018-11-11 Thread Magnus Jonsson
We got the same problem on our clusters. It was due to our backup script of mysql was locking the tables (and taking to long time). If looking at ''mod_time'' and ''control_host'' of ''cluster_table'' in the database: select mod_time,control_host from cluster_table; We found that ''mod_time''

Re: [slurm-users] constraints question

2018-11-11 Thread Christopher Samuel
Hi Doug, On 12/11/18 8:34 am, Douglas Jacobsen wrote: I think you'll need to update to 18.08 to get this working, constraint arithmetic and knl were not compatible until that release. Thanks! That's planned for us today (though we're not using constraints) and from the sound of it Tina should

Re: [slurm-users] constraints question

2018-11-11 Thread Douglas Jacobsen
I think you'll need to update to 18.08 to get this working, constraint arithmetic and knl were not compatible until that release. Doug Jacobsen, Ph.D. NERSC Computer Systems Engineer Acting Group Lead, Computational Systems Group National Energy Research Scientific Computing Center

[slurm-users] Slurm User Group 2018 presentations online, SC18

2018-11-11 Thread Tim Wickberg
Many thanks to all the attendees, and especially to all those who presented at the Slurm User Group 2018 meeting in Madrid. Thank you to CIEMAT as well for hosting, and I hope to see many of you at SLUG'19 at the University of Utah in Salt Lake City. PDFs of the presentations are online at htt

Re: [slurm-users] Reserving a GPU

2018-11-11 Thread Chris Samuel
On Tuesday, 6 November 2018 5:30:31 AM AEDT Christopher Benjamin Coffey wrote: > Can anyone else confirm that it is not possible to reserve a GPU? Seems a > bit strange. This looks like the bug that was referred to previously. https://bugs.schedmd.com/show_bug.cgi?id=5771 Although looking at th

Re: [slurm-users] constraints question

2018-11-11 Thread Chris Samuel
On Tuesday, 6 November 2018 11:06:43 PM AEDT Tina Friedrich wrote: > So what am I doing wrong with the 'or'? I don't have node features defined (other than for KNL nodes), so I can't test your scenario, but I do see similar as I get the error: $ srun -C "broadwell|haswell" --pty /bin/bash srun: