Hi all,
First off, thank you all for all of your quick replies and suggestions on how
to solve this problem of mine.
With some additional help from Tina Friedrich I did a test proposed by her:
First step: ”ssh -X ”.
Second step: ”ssh -X localhost”.
Third step: ”srun —x11 ”
(It should read a d
All,
Wanted to reach out for input on how folks compile slurm when you have a
hybrid cluster.
Scenario:
you have 4 node types:
A) CPU only
B) GPU Only
C) CPU+IB
D) GPU+IB
So, you can compile slurm with/without IB support and/or with/without
GPU support.
Including either option creates a d
Hi Ole,
Thanks a lot for sharing the resource!
Our biggest concern is the case where a user asks for 2 nodes and one of
those is a Cascade Lake node and the other one is a Haswell node. At this
point, the environment have modules that work on both, hence my preference
for reading the slurm nodeli
On 28-02-2020 19:44, Sajid Ali wrote:
If I install multiple versions of a software library, each optimized for
a different partition based on CPU architecture, how would I
automatically load the version of software based on the nodes allocated
to the job ?
Ideally I'd want to store the module
Hi Slurm-developers/users,
If I install multiple versions of a software library, each optimized for a
different partition based on CPU architecture, how would I automatically
load the version of software based on the nodes allocated to the job ?
Ideally I'd want to store the modules for each cpu
I need your help.
We have had a request to generate a report showing the number of jobs by date
showing pre-empted jobs. We used sacct to try to gather the data but we only
found a few jobs with the state "PREEMPTED".
Scanning the slurmd logs we find there are a lot of job that show pre-empte
Hi Marcus,
You are correct, but it doesnt show anything regarding the X11-forwarding.
Thank you for your input!
Best regards,
Pär Lundö
From: "slurm-users"
Sent: 28 feb. 2020 15:57
To: "slurm-users@lists.schedmd.com"
Subject: Re: [slurm-users] Slurm 19.05 X11-f
Thanks, that was very useful. The key takeaways for me are:
Set “PrivateData=cloud”. Documentation states that the default is that
everything’s public and that those options make things private. Apparently
except for this case which allows regular uses to see nodes that are powered
down.
Set “
Hi Pär,
yes , you can -v or e.g. -vvv together with srun. I'm not sure, though,
if taht shows anything X11-related, but you might try.
Best
Marcus
On 2/28/20 3:45 PM, Pär Lundö wrote:
Hi everyone
Thank you for your support.
I’ve done a few changes and done some further testing but it has
Hi everyone
Thank you for your support.
I’ve done a few changes and done some further testing but it has not solved my
problem.
Regardless of the settings for the sshd I can’t get it to
I am able to use SSH to the node directly along with ”-X”-argument and get the
X11 forwarding to work.
Is the
Hello,
on an old machine CentOS 6.10, i've installed slurm 18.08.3 from
sources, and tried to configure a simple configuration (attached
slurm.conf).
Afterstarting slurmctld et slurmd, sinfo shows everything oaky, but at
the first submission with sbatch, i got errors and the node becomes "drain
When I made similar queues, and only wanted my GPU jobs to use up to 8 cores
per GPU, I set Cores=0-7 and 8-15 for each of the two GPU devices in gres.conf.
Have you tried reducing those values to Cores=0 and Cores=20?
> On Feb 27, 2020, at 9:51 PM, Pavel Vashchenkov wrote:
>
> External Email
I'm running clusters entirely in Google Cloud. I'm not sure I'm
understanding the issue--do the nodes disappear from view entirely only
when they fail to power up by ResumeTimeout? Failures of this kind are
happening in GCE when resources are momentarily unavailable, but the nodes
are still there,
Ole Holm Nielsen writes:
> You may use the (undocumented) format=... option to select only the
A while ago, after meticulous study of the man page, I discovered that
the format option is not actually undocumented, it is just very well
hidden. :) All that "man sacctmgr" says about it is
GLOBAL F
sacctmgr show association
You may use the (undocumented) format=... option to select only the
columns you want, for example:
sacctmgr show assoc format=user,account,qos
Usage of the format option is only given in the Examples section of the
sacctmgr page https://slurm.schedmd.com/sacctmgr.ht
Hi,
your looking for 'associations' between users, accounts and their limits.
Try `sacctmgr show assoc [tree]`
Best,
Marcus
On 20-02-28 09:38, Matthias Krawutschke wrote:
> Dear Slurm-User,
>
>
>
> I have a simple question about User and Account Management on SLURM.
>
>
>
> How can I f
Dear Slurm-User,
I have a simple question about User and Account Management on SLURM.
How can I find /print out, which User is associated with which account?
I can list accounts and User, but not in combination. I had no found this on
the documentation.
Best regards
.
Ma
17 matches
Mail list logo