Every month I run sacct to see who's been running jobs.
sacct -v --start=2021-02-01 --end=2021-03-01 -o user
runs and returns just fine.
any end date pass 2021-03-01 seems to hang, eg:
sacct -v --start=2021-02-01 --end=2021-03-02 -o user
never returns.
Thoughts?
Running slurm-ohpc-18.08.8-4.
Hmm.. I will have to investigate pam_slurm_adopt .
David William Botsch
Programmer/Analyst
@CNFComputing
bot...@cnf.cornell.edu
On October 17, 2018 12:03:02 AM Chris Samuel wrote:
On Wednesday, 17 October 2018 12:04:05 AM AEDT
ot to stray too far
from the defaults to make upgrades easier.
Thanks.
On Tue, Oct 16, 2018 at 01:41:15PM -0600, Michael Jennings wrote:
> On Tuesday, 16 October 2018, at 09:30:13 (-0400),
> Dave Botsch wrote:
>
> > Hrm... it looks like the default install of OHPC went with
n Mon, Oct 15, 2018 at 05:51:10PM -0400, Dave Botsch wrote:
>
>
> Wanted to test X11 forwarding. X11 forwarding works as a normal user
> just ssh'ing to a node and running xterm/etc.
>
> With srun, however:
>
> srun -n1 --pty --x11 xterm
> srun: error: Unable to a
(I think same for DSA/RSA keys etc).
>
> Tina
>
> On Tuesday, 16 October 2018 09:31:17 BST Dave Botsch wrote:
> > That's not the issue, here (though I have experienced that before).
> > Regular ssh forwarding works fine.
> >
> > On Tue, Oct 16, 2018 at 0
Sadly that did not make a difference.
On Mon, Oct 15, 2018 at 09:31:26PM -0400, R. Paul Wiegand wrote:
> I believe you also need:
>
> X11UseLocalhost no
>
>
>
> > On Oct 15, 2018, at 7:07 PM, Dave Botsch wrote:
> >
> > Hi.
> >
> > X1
with that; setting the hostnames to the
> > short hostname made it all magically work.
> >
> > Tina
> >
> > On Tuesday, 16 October 2018 09:29:01 BST Olivier Sallou wrote:
> >> On 10/16/2018 01:07 AM, Dave Botsch wrote:
> >>> Hi.
> >>>
&g
e
> instructions. At the moment, I don't have access to the server.
>
> Regards,
> Mahmood
>
>
>
> Sent from Gmail on Android
>
>
>
>
> On Tue, Oct 16, 2018, 05:03 R. Paul Wiegand wrote:
>
> > I believe you also need:
> >
> >
16 October 2018 09:29:01 BST Olivier Sallou wrote:
> > On 10/16/2018 01:07 AM, Dave Botsch wrote:
> > > Hi.
> > >
> > > X11 forwarding is enabled and works for normal ssh.
> >
> > I faced same issue, with ssh x11 working as expected on compute no
n't work with that; setting the hostnames to the
> >> short hostname made it all magically work.
> >>
> >> Tina
> >>
> >> On Tuesday, 16 October 2018 09:29:01 BST Olivier Sallou wrote:
> >>> On 10/16/2018 01:07 AM, Dave Botsch wrote:
&g
561.297.2647
>
> Fax 561.297.0222
>
> [image] <https://hpc.fau.edu/wp-content/uploads/2015/01/image.jpg>
>
>
>
> From: slurm-users on behalf of Dave
> Botsch
> Sent: Monday, October 15, 2018 5:51 PM
> To: slurm-user
Wanted to test X11 forwarding. X11 forwarding works as a normal user
just ssh'ing to a node and running xterm/etc.
With srun, however:
srun -n1 --pty --x11 xterm
srun: error: Unable to allocate resources: X11 forwarding not available
So, what am I missing?
Thanks.
PS
srun --version
slurm 1
the process because it may need to pick a much earlier start time in the
> past to summarize.
>
> Sacctmgr show runawayjobs can help identify if you are in this situation
>
> On Sun, Oct 14, 2018 at 2:05 PM Dave Botsch wrote:
>
> > This seems to reflect what I am seeing. S
d only be initiated one way. However, restarting slurmdbd would restart
> the connection and resync the latest state (or something like that, it was a
> few years ago)
>
> > On 14 Oct 2018, at 21:49, Dave Botsch wrote:
> >
> > This seems to reflect what I am seeing. So
This seems to reflect what I am seeing. Someone earlier mentioned
multiple restarts of slurmdbd... those restarts never made data appear
unless right around on the hour.
It's as if instead of data getting sent right through slurmdbd that
something in slurmdbd is just doing an hourly check of the t
Hi.
I am setting up a new slurm cluster instance. And I just went through
what I thought were the right steps to get job accounting going with
slurmdbd.
So I know that slurmdbd itself works as I can use the sacctmgr commands
to add users and accounts, and the users cannot run jobs unless I first
16 matches
Mail list logo