--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research
hen executed).
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Com
At this point, I’d probably crank up the logging some and see what it’s saying
in slurmctld.log.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr
*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
`'
On Nov 27, 2024, at 09:56, Kent L. Hanson via slurm-users
___
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
`'
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
The benefits are pretty limited if you don’t have the server upgraded anyway,
unless you’re just saying it’s easier to install a current client.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski
quick response Ryan!
Are there any recommendations for bf_ options from
https://slurm.schedmd.com/sched_config.html that could help with this?
bf_continue? Decreasing bf_interval= to a value lower than 30?
On Tue, Jun 4, 2024 at 4:13 PM Ryan Novosielski
mailto:novos...@rutgers.edu>>
This is relatively true of my system as well, and I believe it’s that the
backfill schedule is slower than the main scheduler.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
un. I suspect it
is doing something.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Offi
One of the other states — down or fail, from memory — should cause it to
completely drop the job.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr
, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
`'
On May 16, 2024, at
| Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
`'
On Feb 16, 2024, at 14:41, Jason Simms via slurm-users
wrote:
Hello all,
I'v
Ah, I see — no, it’s 24.08. That’s why I didn’t find any reference to it.
Carry on! :-D
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
`'
, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
`'
On J
This is basically always somebody filling up /tmp and /tmp residing on the same
filesystem as the actual SlurmdSpoolDirectory.
/tmp, without modifications, it’s almost certainly the wrong place for
temporary HPC files. Too large.
Sent from my iPhone
> On Dec 8, 2023, at 10:02, Xaver Stiensmeie
*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
`'
On Dec 7, 2023, at 15:09, Chip Seraphine
It primarily does other things, but you can interact with Slurm in Open
OnDemand.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973
, 2023, at 5:34 PM, Ryan Novosielski wrote:
Looks like 24.08 to me, so s/introduced/introduces.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr
Looks like 24.08 to me, so s/introduced/introduces.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
What do you mean by management node, slurmctld? Or just a node with the client
software on it?
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr
The pam_slurm.so<http://pam_slurm.so> module has an impact on these values, if
you are using it.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ Universit
What we say at our site is that you should use srun, if you don’t use srun, you
will see limited, if any, output on resource usage in the various places you
can see it (sacct, etc), and I learned recently that sattach won’t work either.
I find it’s also easier to make mistakes with resource use
.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB
etc.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Resea
You can get some information on that from sdiag, and there are tweaks you can
make to backfill scheduling that affect how quickly it will get to a job.
That doesn’t really answer your real question, but might help you when you are
looking into this.
Sent from my iPhone
On Sep 29, 2023, at 16:1
outs, it’s pretty uneventful. You
won’t have that long database upgrade period, since no database modifications
will be required, so it’s pretty much like upgrading anything else.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State
, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB A555B, Newark
`'
On Sep 28, 2023, at
is what I’d ask for.
I assume that archiving, in general, would also remove this stuff, since old
jobs themselves will be removed?
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos
for.
I assume that archiving, in general, would also remove this stuff, since old
jobs themselves will be removed?
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
te these things.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Ad
Or an airport hotel for the first night. Done that many times.
Sent from my iPhone
On Aug 19, 2023, at 13:53, Lloyd Brown wrote:
Something else to consider that I just thought of.
If you're arriving late on Sunday, and SLUG doesn't start until Tuesday, you
cound just get a hotel in SLC some
I didn’t know that one! Thank you.
Sent from my iPhone
On Aug 17, 2023, at 09:50, Alain O' Miniussi wrote:
Hi Sean,
A colleague pointed to me the following commands:
#scontrol show hostname x[1000,1009,1029-1031]
x1000
x1009
x1029
x1030
x1031
#scontrol show hostlist x[1000,1009,1029,1030,10
I tend not to let them login. It will get their attention, and prevent them
from just running their work on the login node when they discover they can’t
submit. But appreciate seeing the other options.
Sent from my iPhone
> On May 25, 2023, at 19:19, Markuske, William wrote:
>
> Hello,
>
>
a shell for salloc is a newer feature.
For your version, you should:
srun -n 1 -t 00:10:00 --mem=1G --pty bash
Brian Andrus
On 5/19/2023 8:24 AM, Ryan Novosielski wrote:
I’m not at a computer, and we run an older version of Slurm yet so I can’t say
with 100% confidence that his this has
I’m not at a computer, and we run an older version of Slurm yet so I can’t say
with 100% confidence that his this has changed and I can’t be too specific, but
I know that this is the behavior you should expect from that command. I believe
that there are configuration options to make it behave di
I think it’s easier than all of this. Are you actually changing names of all of
these things, or just IP addresses? It they all resolve to an IP now and you
can bring everything down and change the hosts files or DNS, it seems to me
that if the names aren’t changing, that’s that. I know that “sc
/pestat/pestat
--
#BlackLivesMatter
|| \\UTGERS, |-------*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
d the graphs look awesome!
> Would you be willing to share the scripts you're using to generate
> those reports? That sounds like something many sites could benefit
> from!
Agreed, same.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*-
We basically always do this. Just be mindful of how long it takes to upgrade
your database (if you have that ability to do a dry run, you might ant to do
that). That’s true of any upgrade, though.
If you have to skip more than one version, you’ll have to upgrade in stages.
On Nov 10, 2022, at 7
des. Would
Slurm enforce limits properly ("qos" or "partition" limits)?
Kind Regards
--
#BlackLivesMatter
|| \\UTGERS, |--*O*----
||_// the State |Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Techno
topology plugin. We use
this to keep jobs from spanning two different infiniband fabrics that
are connected together via lower bandwidth than the rest of the fabric.
--
#BlackLivesMatter
|| \\UTGERS, |--*O*
||_// the State |Ryan Novo
much for pointing me in the correct direction.
Thanks,
Reed
On Jun 15, 2022, at 7:50 PM, Ryan Novosielski
mailto:novos...@rutgers.edu>> wrote:
Apologies for not having more concrete information available when I’m replying
to you, but I figured maybe having a fast hint might be better.
Apologies for not having more concrete information available when I’m replying
to you, but I figured maybe having a fast hint might be better.
Have a look at how the various daemons communicate with one another. This
sounds to me like a firewall thing between maybe the SlurmCtld and where the
S
I’m not 100% certain that this affects this situation, but there’s a slurm.conf
setting called EnforcePartLimits that you might want to change.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski
e is lost. You don’t normally see that memory being
used like that, because slurmdbd is normally up/accepting the accounting data.
--
#BlackLivesMatter
|| \\UTGERS, |-------*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
||
, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
>
related to "squeue -O". May not work with Slurm 19.05 and older.
:04 04 dee11077f72dd898dcadccf9d0dd2cfc438a8d1f
61880fe14a49a7a96167b89d21dede41f2751d86 M pestat
> On Dec 14, 2021, at 4:29 PM, Ryan Novosielski wrote:
>
> Hi Ole,
>
> Thanks again for your great
date/time.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced
* 128000 116325
You can see Joblist and JobID User are not present.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922
rentice
>
>
> On 4/22/21 10:55 AM, Ryan Novosielski wrote:
>> My recollection is that this parameter is talking about “ulimit” parameters,
>> and doesn’t have to do with cgroups. The documentation is not as clear here
>> as it could be, about what this does, the mec
My recollection is that this parameter is talking about “ulimit” parameters,
and doesn’t have to do with cgroups. The documentation is not as clear here as
it could be, about what this does, the mechanism by which it’s applied (PAM
module), etc.
Sent from my iPhone
> On Apr 22, 2021, at 09:07
*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
> On Apr 16, 2021, at 6:21 PM, Juer
nning.
Anyway, I figure this is something people probably need to know often enough.
Any tips?
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist
, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
> On Feb 3, 2021, at
*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
On J
Thanks, that’s great! I do a lot of that by hand (including lots over this
weekend), so it will be a nice timesaver.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Re
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Re
AFAIK, if you have this set up correctly, nvidia-smi will be restricted too,
though I think we were seeing a bug there at one time in this version.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan
As root, -a is effectively applied to every command I’m aware of.
--
#BlackLivesMatter
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922
I’ve previously seen code contributed back in that way. See bug 1611 as an
example (happened to have looked at that just yesterday).
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu
| Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
On Sep 30, 2020, at 10:57, Relu Patrascu wrote:
Hi all
Absolutely not. It’s recommended.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS
, in the case of
mpirun, etc.).
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS
NodeName=cuda[001-008] Name=gpu File=/dev/nvidia[2-3] CPUs=12-23
This also seems to be related:
https://slurm.schedmd.com/SLUG19/GPU_Scheduling_and_Cons_Tres.pdf
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos
.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
Sorry about that. “NJT” should have read “but;” apparently my phone decided I
was talking about our local transit authority. 😓
On Aug 25, 2020, at 10:30, Ryan Novosielski wrote:
I believe that’s done via a QoS on the partition. Have a look at the docs
there, and I think “require” is a good
, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MS
*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
On Jul 2, 2020, at 09:5
getting it from the
VM somehow.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced
heir issues. I have used the
>>> "export=DISPLAY, HOME" as an additional argument for srun but
>>> without any progress. Anyone with similiar problem who can aid
>>> or advice me on howto use the X11Forward feature? Any help is
>>> much appreciat
The node is not getting the status from itself, it’s querying the slurmctld to
ask for its status.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973
Check slurm.conf for StateSaveLocation.
https://slurm.schedmd.com/slurm.conf.html
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922
| Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
On Dec 11, 2019, at 22:41, Victor (Weikai)
#x27;m not sure it makes any difference here)
--
|| \\UTGERS,|---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office o
Do you mean akin to what some would consider "CPU efficiency" on a CPU job?
"How much... used" is a little vague.
From: slurm-users on behalf of Prentice
Bisbal
Sent: Thursday, November 14, 2019 13:41
To: Slurm User Community List
Subject: [slurm-users]
IS an interaction of
some sort.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mailto:novos...@rutgers.edu>
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS
du> O: 212-746-6305
> F: 212-746-8690
- --
____
|| \\UTGERS, |--*O*
||_// the State |Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 ~*~ RBHS Campus
|
epool separate to the processes
address space.
All the best,
Chris
--
Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski -
novos...@rutgers.edu<mai
want? I’m not so sure.
How soon will someone figure out that they might get a higher priority based on
requesting some feature they don’t need?
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
e have any ideas whether this can be made to work and, if
> so, how?
- --
|| \\UTGERS, |----------*O*
||_// the State |Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 ~*~ RBHS Campus
|| \\of NJ | Office of Ad
> On Mar 22, 2019, at 4:22 AM, Ole Holm Nielsen
> wrote:
>
> On 3/21/19 6:56 PM, Ryan Novosielski wrote:
>>> On Mar 21, 2019, at 12:21 PM, Loris Bennett
>>> wrote:
>>>
>>> Our last cluster only hit around 2.5 million jobs after
>>>
orting >24 hour database conversion times.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanc
I’ve never seen a paycheck signed by “Best Practices”.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
signature.asc
Description: Message signed with OpenPGP
pology/tree plugin.
>> """
>>
>> So the Topology plugin does take precedence over the weighting
>> algorithm, but it doesn't disable it, AFAIK. And for sites using
>> disjoint networks, as we do, this is a sane behavior.
>>
>> Cheers,
>
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
is intended for
>> serial and low-core count parallel jobs) If I just leave those nodes out of
>> the topology.conf file, will that have the desired affect of not allocating
>> multi-node jobs to those nodes, or will it result in an error of some sort?
--
|| \\UTG
ound that, I guess, but by default, the behavior seems
to be roughly the inverse of the node weights.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
signature.asc
Description: Message signed with OpenPGP
I don’t actually know the answer to this one, but we have it provisioned to all
nodes.
Note that if you care about node weights (eg. NodeName=whatever001 Weight=2,
etc. in slurm.conf), using the topology function will disable it. I believe I
was promised a warning about that in the future in a
it’s going to ignore the topology plugin,
but I believe it works (and the documentation sure indicates it does).
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (
ver, what’s the advantage of “salloc --x11 srun” vs. just
"srun --x11”?
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
then occasionally send srun commands
over to it.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ
m/cgroup.conf
> ConstrainCores=yes
> ConstrainRAMSpace=yes
> ConstrainSwapSpace=yes
>
> Cheers,
> Chris
>
> —
> Christopher Coffey
> High-Performance Computing
> Northern Arizona University
> 928-523-1167
>
>
--
|| \\UTGERS, |
cpu of a job?
>>
>> Hello everyone,
>>
>> How to check the percent cpu of a job in slurm? I tried sacct, sstat,
>> squeue, but I can't find that how to check.
>> Can someone help me?
>>
>> Best regards,
>> Yalei
>>
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus
|| \\of NJ | Office of Advanced Research Computing - MSB C630, Newark
`'
Thanks Olm! I am quite fond of your utilities — thank you for providing them.
Sent from my iPhone
> On Nov 21, 2018, at 08:51, Ole Holm Nielsen
> wrote:
>
> Dear Slurm users,
>
> The Slurm tool "pestat" (Processor Element status) has been enhanced due to a
> user request. Now pestat will d
set
to offline such nodes, but that affects job preemption. What sort of
choices do others make in this area?
- --
|| \\UTGERS, |------*O*
||_// the State |Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973/9
/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
Reason=HDRT #1019681 [root@2018-08-06T12:14:44]
Thanks!
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski - novos...@rutgers.edu
|| \\ University | Sr. Technologist - 973
> On Jul 23, 2018, at 10:31 PM, Ian Mortimer wrote:
>
> On Tue, 2018-07-24 at 02:19 +0000, Ryan Novosielski wrote:
>
>> Best off running nvidia-persistenced. Handles all of this stuff as a
>> side effect, and also enables persistence mode, provided you don’t
>> con
Best off running nvidia-persistenced. Handles all of this stuff as a side
effect, and also enables persistence mode, provided you don’t configure it
otherwise.
--
|| \\UTGERS, |---*O*---
||_// the State | Ryan Novosielski
1 - 100 of 120 matches
Mail list logo