Re: [slurm-users] seff: incorrect memory usage (18.08.5-2)

2019-03-04 Thread Christopher Samuel
On 2/26/19 5:49 AM, Marcus Wagner wrote: If I remember right, there was a discussion lately in this list regarding the JobAcctGatherType, yet I do not remember the outcame It used to be that SchedMD would strongly recommend the non-group way of gathering information, but that never really wor

[slurm-users] Slurm message aggregation

2019-03-04 Thread Christopher Samuel
Hi folks, Anyone here tried Slurm's message aggregation (MsgAggregationParams in slurm.conf) at all? All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA

Re: [slurm-users] Migrate the slurmdbd service to another server

2019-03-04 Thread Ole Holm Nielsen
On 04-03-2019 16:30, Loris Bennett wrote: On 3/4/19 2:26 PM, Loris Bennett wrote: Ole Holm Nielsen writes: We're one of the many Slurm sites which run the slurmdbd database daemon on the same server as the slurmctld daemon. This works without problems at our site given our modest load, howeve

Re: [slurm-users] seff: incorrect memory usage (18.08.5-2)

2019-03-04 Thread Christopher Benjamin Coffey
You are welcome Loris! — Christopher Coffey High-Performance Computing Northern Arizona University 928-523-1167 On 2/26/19, 8:16 AM, "slurm-users on behalf of Loris Bennett" wrote: Hi Chris, I had JobAcctGatherType=jobacct_gather/linux TaskPlugin=task/affinity

Re: [slurm-users] Priority access for a group of users

2019-03-04 Thread david baker
Hello, Thank you for reminding me about the sbatch "--requeue" option. When I submit test jobs using this option the preemption and subsequent restart of a job works as expected. I've also played around with "preemptmode=suspend" and that also works, however I suspect we won't use that on these "d

Re: [slurm-users] Migrate the slurmdbd service to another server

2019-03-04 Thread Loris Bennett
Hi Ole, Ole Holm Nielsen writes: > On 3/4/19 2:26 PM, Loris Bennett wrote: >> Ole Holm Nielsen writes: >>> We're one of the many Slurm sites which run the slurmdbd database daemon on >>> the >>> same server as the slurmctld daemon. This works without problems at our >>> site >>> given our mo

Re: [slurm-users] Migrate the slurmdbd service to another server

2019-03-04 Thread Ole Holm Nielsen
On 3/4/19 2:26 PM, Loris Bennett wrote: Ole Holm Nielsen writes: We're one of the many Slurm sites which run the slurmdbd database daemon on the same server as the slurmctld daemon. This works without problems at our site given our modest load, however, SchedMD recommends to run the daemons on

Re: [slurm-users] Migrate the slurmdbd service to another server

2019-03-04 Thread Loris Bennett
Hi Ole, Ole Holm Nielsen writes: > We're one of the many Slurm sites which run the slurmdbd database daemon on > the > same server as the slurmctld daemon. This works without problems at our site > given our modest load, however, SchedMD recommends to run the daemons on > separate servers. > >

Re: [slurm-users] Migrate the slurmdbd service to another server

2019-03-04 Thread Ole Holm Nielsen
On 3/4/19 1:27 PM, Paul Edmon wrote: That should work. It did work as expected :-) The upgrade though will have to wait until the dbd is actually on a different server. Yes, that's the whole point of first migrating slurmdbd to a different server! Upgrading the Slurm RPMs on the slurmdb

Re: [slurm-users] Migrate the slurmdbd service to another server

2019-03-04 Thread Paul Edmon
That should work.  The upgrade though will have to wait until the dbd is actually on a different server.  We run the ctld and dbd on the same machine here for the sake of performance.  Before the rpm reorg we used to upgrade only the dbd first and then the ctld but with the reorg I'm taking a d