Re: [slurm-users] DBD_SEND_MULT_MSG - invalid uid error

2024-01-08 Thread Craig Stark
This ticket with SchedMD implies it's a munged issue: https://urldefense.com/v3/__https://bugs.schedmd.com/show_bug.cgi?id=1293__;!!CzAuKJ42GuquVTTmVmPViYEvSg!N2M1a84yfU8mhdQ87LnBMQxye_nBsrTzTow7spIqZaQ2dLevBDZy4oNMT8KzMsmhxdRwchIht3Tgl3p8cMHhFOg9ry546OQ_iA$ Is the munge daemon running on all sys

Re: [slurm-users] DBD_SEND_MULT_MSG - invalid uid error

2024-01-08 Thread Timony, Mick
This ticket with SchedMD implies it's a munged issue: https://bugs.schedmd.com/show_bug.cgi?id=1293 Is the munge daemon running on all systems? If it is, are all servers running a network time daemon such chronyd or ntpd and the time is in sync on all hosts? Regards --Mick _

Re: [slurm-users] DBD_SEND_MULT_MSG - invalid uid error

2024-01-08 Thread Craig Stark
3rd time trying to get this to come through to the list - hopefully this time works. I've been running SLURM for several years now, but in setting it up on a new cluster, I'm hitting a recurring issue. I'm using a MariaDB and configured it just as I had in my several-year-ago setup and in the

Re: [slurm-users] Multifactor fair-share with single account

2024-01-08 Thread Kamil Wilczek
Thank you all for the help! I created a setup with a single account and multi-factor scheduling with three non-zero weights: job age, job size and fair-share. I'll monitor the fair-share when enough users will register on the cluster. Kind regards, -- Kamil Wilczek [https://keys.openpgp.org/] [D4

[slurm-users] Tool for profiling resource usage by slurm jobs

2024-01-08 Thread Nicolas Granger
Hi all, Happy new year everyone! I've been looking for a simple tool that reports how much resources are actually consumed by a job to help my colleagues and I adjust job requirements. I could not find such a tool, or the ones mentioned on this ML were not easy to install and use, so I have w