[slurm-users] Integrating Fabric Modules with SLURM – Best Practices for HPC Networking?

2025-07-15 Thread xoyeyor178--- via slurm-users
? Performance Gains – Measurable improvements in job throughput or MPI communication? Troubleshooting – Known conflicts with SLURM’s network topology detection? -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: Slurm-wlm depends on X11?

2025-07-10 Thread Jesse Hayward via slurm-users
gt;> >>> Jess, please feel free to reach out to me directly via Email or give me >>> a call. I would like to hear a bit more. >>> >>> Cheers all, >>> Jesse >>> >>> On Thu, Jun 26, 2025 at 6:42 AM Kevin Buckley < >>> kevin.buckley.

[slurm-users] Re: Slurm-wlm depends on X11?

2025-07-10 Thread Jesse Hayward via slurm-users
se feel free to reach out to me directly via Email or give me a >> call. I would like to hear a bit more. >> >> Cheers all, >> Jesse >> >> On Thu, Jun 26, 2025 at 6:42 AM Kevin Buckley < >> kevin.buckley.pawsey.org...@gmail.com> wrote: >> >>&g

[slurm-users] Re: Slurm-wlm depends on X11?

2025-07-10 Thread Jesse Hayward via slurm-users
gt; libgdk-x11-2.0.so.0()(64bit) >> libgdk_pixbuf-2.0.so.0()(64bit) >> libgtk-x11-2.0.so.0()(64bit) >> libpango-1.0.so.0()(64bit) >> libpangocairo-1.0.so.0()(64bit) >> libpangoft2-1.0.so.0()(64bit) >> >> but, sadly, they are all requirements for the "main" >> >> slurm >> >> package, which deployes sview, and not, as one might hope >> >> slurm-contribs >> >> where you might hope to find them, or even completely >> off-side in some (currently non-existing) >> >> slurm-contribs-gui >> >> package. >> >> >> -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Is DefCpuPerGPU affected by --ntasks-per-gpu?

2025-07-10 Thread Milad Alizadeh via slurm-users
e job didn't specify --cpus-per-task and --cpus-per-gpu" which I do not specify, at least not directly. If yes is there a way to set something like DefCpuPerTask? -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Setting memory by assigned node with a plugin

2025-07-09 Thread laddaoui--- via slurm-users
this information and modify the memory allocation accordingly? Or is there another approach you would recommend for dynamically setting memory based on the assigned node? Any insights would be greatly appreciated. Best regards, Nacereddine -- slurm-users mailing list -- slurm-users@lists.schedm

[slurm-users] Slurm versions 25.05.1 and 24.11.6 are now available

2025-07-08 Thread Marshall Garey via slurm-users
- Commercial Slurm Development and Support -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Energy profile data not recorded in InfluxDB or HDF5

2025-07-02 Thread Xand Meaden via slurm-users
working? I really hope I'm missing something obvious here! Thanks, Xand -- Xand Meaden | Principal Research Infrastructure Engineer e-Research | King's College London -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: Slurm-wlm depends on X11?

2025-07-01 Thread Jesse Hayward via slurm-users
ick up the chrooted image. Jess, please feel free to reach out to me directly via Email or give me a call. I would like to hear a bit more. Cheers all, Jesse On Thu, Jun 26, 2025 at 6:42 AM Kevin Buckley < kevin.buckley.pawsey.org...@gmail.com> wrote: > On 2025/06/25 23:39, Jesse Hayward vi

[slurm-users] sbatch strange behavior with different --nodelist (-w) options

2025-06-30 Thread Xinghong He via slurm-users
Xinghong -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] extern step does not exit

2025-06-27 Thread william--- via slurm-users
system.I am not sure whether the error messages about setgroups and _shutdown_x11_forward are actually the problem, or just something else being reported. The only system that I have to compare with is running Slurm 19.05 on CentOS 7 and is rather different. I would be interested to know if anyone else has had problems with extern job steps not shutting down. William -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: Slurm-wlm depends on X11?

2025-06-25 Thread Kevin Buckley via slurm-users
On 2025/06/25 23:39, Jesse Hayward via slurm-users wrote: Does slurm actually depend on x11 now? Or do I need to take a look at my apt config and see what's going on over there, instead. TL;DR: it doesn't have to - but you might need to buid your own DEB-files Looking at the

[slurm-users] Re: Slurm-wlm depends on X11?

2025-06-25 Thread Christopher Samuel via slurm-users
On 6/25/25 11:39 am, Jesse Hayward via slurm-users wrote: Does slurm actually depend on x11 now? My guess would be that is coming from the sview package, which is an X11 program. Why that would be getting pulled in is what I think you'd want to look for. https://slurm.schedm

[slurm-users] Slurm-wlm depends on X11?

2025-06-25 Thread Jesse Hayward via slurm-users
land). * Does slurm actually depend on x11 now? Or do I need to take a look at my apt config and see what's going on over there, instead. Thanks! Jess -- Jesse Hayward Systems Administrator for High Performance Computing Vassar College 845.437.7521 CIS 207 -- slurm-users mailing list --

[slurm-users] enforce Qos to users

2025-06-24 Thread laddaoui--- via slurm-users
to enforce QoS restrictions properly? Best, --- info about my setup slurm version : tested on 23.11.4 and 23.02.7 AccountingStorageEnforce = associations,limits EnforcePartLimits = ALL -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: [EXTERNAL] enforce Qos to users

2025-06-24 Thread Greg Wickham via slurm-users
Do you have “AccountingStorageEnforce” defined? We use: AccountingStorageEnforce=associations,limits,qos -Greg On 24/06/2025, 16:17, "laddaoui--- via slurm-users" wrote: Hello everyone, I'm trying to use QoS to enforce resource limits on an association, but I'm

[slurm-users] Re: Job information if job is completed

2025-06-24 Thread Prentice Bisbal via slurm-users
data was in /var/lib/slurm, or something like that, and accounting information was in a mysql DB, so two different data formats. Prentice On 6/17/25 7:42 AM, Davide DelVento via slurm-users wrote: Yeah, that's an annoying thing which I have not understood why has been designed that way. The

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-24 Thread Prentice Bisbal via slurm-users
led 5 minutes before it finishes after running for 72 hours is also wasteful of computer time. It's a balancing act. Yes there's checkpointing, but that's way outside the scope of this conversation. Prentice On 6/17/25 8:15 AM, Davide DelVento via slurm-users wrote: This

[slurm-users] Re: [EXTERNAL] enforce Qos to users

2025-06-24 Thread laddaoui--- via slurm-users
Thanks! It works perfectly now. Nacereddine -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: pam_slurm_adopt - ssh to compute nodes not working in slurm 24.11

2025-06-23 Thread Christopher Samuel via slurm-users
On 6/23/25 5:56 am, Marx, Wolfgang via slurm-users wrote: Is this a known bug in Slurm 24.11 and has anyone facing the same problem. Working fine here with 24.11.5. Just to be sure, your compute nodes are using pam_slurm_adopt from 24.11.5 and not an earlier version by accident? All the

[slurm-users] read-only slurm user

2025-06-23 Thread Hagdorn, Magnus Karl Moritz via slurm-users
  Campus Charité Mitte BALTIC - Invalidenstraße 120/121 10115 Berlin   https://www.charite.de HPC Helpdesk: sc-hpc-helpd...@charite.de smime.p7s Description: S/MIME cryptographic signature -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le

[slurm-users] pam_slurm_adopt - ssh to compute nodes not working in slurm 24.11

2025-06-23 Thread Marx, Wolfgang via slurm-users
, Basisdienste, Gruppe Hochleistungrechnen Technische Universität Darmstadt, Hochschulrechenzentrum Alexanderstraße 2, 64283 Darmstadt Tel.: +496151/16-71158 E-Mail: wolfgang.m...@tu-darmstadt.de Web: www.hrz.tu-darmstadt.de smime.p7s Description: S/MIME cryptographic signature -- slurm-users

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-20 Thread Christopher Samuel via slurm-users
On 6/20/25 2:36 pm, Smith, Sebastian via slurm-users wrote: I forced my users to specify time limits and they quickly adapted: You can also set a default (and maximum) time limit per partition, our default time limits are set to 10 minutes, QOS's have limits in between the partition de

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-20 Thread Smith, Sebastian via slurm-users
that, recurring, crime scene… I forced my users to specify time limits and they quickly adapted: `JobSubmitPlugins=require_timelimit` Good luck! Sebastian -- From: Michael DiDomenico via slurm-users Date: Tuesday, June 17, 2025 at 06:20 To: Cc: slurm-users@lists.schedmd.com Subject: [slurm

[slurm-users] Doc Clarification: Heterogeneous Steps in Heterogeneous Job

2025-06-19 Thread Steffen Christgau via slurm-users
? Am I just using a lucky example which is actually not supported? A short clarification would be helpful. Thanks in advance Steffen smime.p7s Description: S/MIME Cryptographic Signature -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users

[slurm-users] Re: Wrong MaxRSS Behavior with cgroup v2 in Slurm

2025-06-19 Thread Guillaume COCHARD via slurm-users
ehavior with cgroup v2 in Slurm What kernel are you using? I had a similar issue in an older RHEL 9 kernel which has now been fixed Sean From: Guillaume COCHARD via slurm-users Sent: Thursday, May 22, 2025 7:10:11 PM To: slurm-us...@schedmd.com Subject: [EXT] [slurm-users] Wrong MaxR

[slurm-users] Re: Job information if job is completed

2025-06-18 Thread Gestió Servidors via slurm-users
t;sacct: error: Unknown error 1064" How could give permissions to user-sacct to allow "sacct -B" command? Thanks. -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: Job information if job is completed

2025-06-18 Thread Daniel Letai via slurm-users
13:29, Carsten Beyer via slurm-users wrote: Hi @all, we use another solution for our finished jobs, everything goes additionally to Elastic (incl. jobscript), see JobCompLoc and JobCompType. We don't store the jobscripts in our SLURM Acc

[slurm-users] Re: Job information if job is completed

2025-06-18 Thread Carsten Beyer via slurm-users
esellschaft: Hamburg Amtsgericht Hamburg HRB 39784 Am 18.06.25 um 11:55 schrieb Gestió Servidors via slurm-users: Hi again, Oh, sorry! Yes, I configured slurm.conf with “AccountingStoreFlags=job_comment,job_env,job_extra,job_script” but I could test… Now, I have tested and I have got my su

[slurm-users] Re: Job information if job is completed

2025-06-18 Thread Gestió Servidors via slurm-users
present que no teniu autorització per fer-ne cap ús. Abans d'imprimir aquest missatge penseu en el medi ambient. De: Guillaume COCHARD Enviat: dimecres, 18 de juny de 2025 10:45 Per a: Gestió Servidors A/c: slurm-users@lists.schedmd.com Tema: Re: [slurm-users] Re: Job information if j

[slurm-users] Re: Job information if job is completed

2025-06-18 Thread Guillaume COCHARD via slurm-users
https://slurm.schedmd.com/sacct.html#OPT_batch-script | https://slurm.schedmd.com/sacct.html#OPT_batch-script ] ) as pointed out by Eric in his response. Cheers, Guillaume De: "Gestió Servidors via slurm-users" À: slurm-users@lists.schedmd.com Envoyé: Mercredi 18 Juin 2025

[slurm-users] Re: Job information if job is completed

2025-06-18 Thread Gestió Servidors via slurm-users
However, how could I copy the submit scrip from the prolog script? Because from the prolog script, I can access to some SLURM variables (https://slurm.schedmd.com/prolog_epilog.html) but I don't know how to know what the script is and run a simple "cp" to a destination folder. Tha

[slurm-users] Re: Job information if job is completed

2025-06-17 Thread Paul Edmon via slurm-users
out everyone accepting that their existing tools will break in favor of unifying all the command structures. -Paul Edmon- On 6/17/2025 7:42 AM, Davide DelVento via slurm-users wrote: Yeah, that's an annoying thing which I have not understood why has been designed that way. The informati

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-17 Thread Michael DiDomenico via slurm-users
lVento via slurm-users wrote: > > This conversation is drifting a bit away from my initial questions and > covering various other related topics. In fact I do agree with almost > everything written in the last few messages. However, that is somewhat > orthogonal to my ini

[slurm-users] Re: Job information if job is completed

2025-06-17 Thread Ole Holm Nielsen via slurm-users
lVento via slurm-users wrote: Yeah, that's an annoying thing which I have not understood why has been designed that way. The information is there and it can be queried, just with a different command and spitting it out in a different format. The syntax is sacct -j  XXX which gives you only

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-17 Thread Davide DelVento via slurm-users
ing the various perspectives at different sites! On Tue, Jun 17, 2025 at 12:26 AM Loris Bennett via slurm-users < slurm-users@lists.schedmd.com> wrote: > Hi Prentice, > > Prentice Bisbal via slurm-users > > writes: > > > I think the idea of havin

[slurm-users] Re: Job information if job is completed

2025-06-17 Thread Davide DelVento via slurm-users
ields,you,want -j XXX the (super long) list of possible fields (case insensitive) can be queried with sacct -e HTH On Tue, Jun 17, 2025 at 4:45 AM Gestió Servidors via slurm-users < slurm-users@lists.schedmd.com> wrote: > Hello, > > > > Is there any way to get all informat

[slurm-users] Re: Job information if job is completed

2025-06-17 Thread Eric Le Lay via slurm-users
available via sacct. Hope this helps. Cheers, Le 17/06/2025 à 12:42, Gestió Servidors via slurm-users a écrit : Hello, Is there any way to get all information (like submit script or submit node) from a job that is completed? Something like “scontrol show jobid=XXX” when job is “running” or

[slurm-users] Job information if job is completed

2025-06-17 Thread Gestió Servidors via slurm-users
Hello, Is there any way to get all information (like submit script or submit node) from a job that is completed? Something like "scontrol show jobid=XXX" when job is "running" or "pending". I need to inspect the submit script of a job but I only know job_id. Tha

[slurm-users] Re: [EXT] Re: slurm_pam_adopt module not working

2025-06-16 Thread William Brown via slurm-users
Fritz via slurm-users, < slurm-users@lists.schedmd.com> wrote: > Yes the file exists in /usr/lib64/security/. > Best, > > > *Fritz Ratnasamy*Data Scientist > Information Technology > > > > > On Tue, Jun 17, 2025 at 12:17 AM Sean Crosby > wrote: > >&

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-16 Thread Loris Bennett via slurm-users
Hi Prentice, Prentice Bisbal via slurm-users writes: > I think the idea of having a generous default timelimit is the wrong way to > go. In fact, I think any defaults for jobs are a bad way to go. The majority > of your > users will just use that default time limit, and backfil

[slurm-users] Re: [EXT] Re: slurm_pam_adopt module not working

2025-06-16 Thread Ratnasamy, Fritz via slurm-users
allow anyone > to log in. > > Sean > ------ > *From:* Ratnasamy, Fritz via slurm-users > *Sent:* Tuesday, 17 June 2025 14:55 > *To:* Kevin Buckley > *Cc:* slurm-users@lists.schedmd.com > *Subject:* [EXT] [slurm-users] Re: slurm_pam_adopt module not working > > * External email:

[slurm-users] Re: [EXT] Re: slurm_pam_adopt module not working

2025-06-16 Thread Sean Crosby via slurm-users
doesn't exist, pam would abnormally exit and not allow anyone to log in. Sean From: Ratnasamy, Fritz via slurm-users Sent: Tuesday, 17 June 2025 14:55 To: Kevin Buckley Cc: slurm-users@lists.schedmd.com Subject: [EXT] [slurm-users] Re: slurm_pam_adopt modul

[slurm-users] Re: slurm_pam_adopt module not working

2025-06-16 Thread Ratnasamy, Fritz via slurm-users
-auth sessioninclude postlogin # Used with polkit to reauthorize users in remote sessions -session optional pam_reauthorize.so prepare *Fritz Ratnasamy*Data Scientist Information Technology On Wed, Jun 11, 2025 at 8:29 PM Kevin Buckley via slurm-users < slurm-us

[slurm-users] Re: MIG H100 with xeon Intel

2025-06-16 Thread Patryk Bełzak via slurm-users
Which hardware platform is this? We've had the same issue on Dell with H100 even without MIG setup, we've had to restart the slurmd daemon after boot in order to make sure that everything is fine. Patryk. On 25/06/12 01:46, Richard Lefebvre via slurm-users wrote: [-- Type: text/plai

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-16 Thread Prentice Bisbal via slurm-users
at they're doing will give them an advantage! My 4 cents (adjusted for inflation). Prentice On 6/12/25 9:11 PM, Davide DelVento via slurm-users wrote: Sounds good, thanks for confirming it. Let me sleep on it wrt the "too many" QOS, or think if I should ditch this idea. If I'

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-12 Thread Davide DelVento via slurm-users
user select one (or indeed select > it from lua) but as far as I know, there is no way to set arbitrary > per-job values. > > Best, > > A. > -- > Ansgar Esztermann > Sysadmin Dep. Theoretical and Computational Biophysics > https://www.mpinat.mpg.de/person/1

[slurm-users] MIG H100 with xeon Intel

2025-06-12 Thread Richard Lefebvre via slurm-users
Parameters=l3cache_as_socket (recommended) or override this by manually specifying core affinity in gres.conf. I did set l3cache_as_socket in the slurm.conf of the node, but I still get the error on the slurm controler I'm running 24.11.5 on AlmaLinux 9.5 Richard -- slurm-users mailing list

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-12 Thread Ansgar Esztermann-Kirchner via slurm-users
t arbitrary per-job values. Best, A. -- Ansgar Esztermann Sysadmin Dep. Theoretical and Computational Biophysics https://www.mpinat.mpg.de/person/11315/3883774 smime.p7s Description: S/MIME cryptographic signature -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe se

[slurm-users] How are the results produced by 'seff'?

2025-06-12 Thread Loris Bennett via slurm-users
t the memory usage reported by 'seff' is unreliable [2]. Is that indeed the case? Cheers, Loris Footnotes: [1] https://github.com/PrincetonUniversity/jobstats [2] https://doc.dhpc.tudelft.nl/delftblue/Slurm-trouble-shooting/ -- Dr. Loris Bennett (Herr/Mr) FUB-IT, Freie Universit

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-12 Thread Davide DelVento via slurm-users
I'm thinking that the user could use the regular wallclock limit setting in slurm and the script could remove that and use it to set the PreemptExemptTime. Thanks, Davide On Thu, Jun 12, 2025 at 3:56 AM Ansgar Esztermann-Kirchner via slurm-users < slurm-users@lists.schedmd.com> wrote: >

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-12 Thread Ansgar Esztermann-Kirchner via slurm-users
PreemptExemptTime has passed. See https://slurm.schedmd.com/preempt.html Best, A. -- Ansgar Esztermann Sysadmin Dep. Theoretical and Computational Biophysics https://www.mpinat.mpg.de/person/11315/3883774 smime.p7s Description: S/MIME cryptographic signature -- slurm-users mailing list -- slurm

[slurm-users] Re: slurm_pam_adopt module not working

2025-06-11 Thread Kevin Buckley via slurm-users
On 2025/06/11 12:46, Ratnasamy, Fritz via slurm-users wrote: We wanted to block users from ssh to a node where there are no jobs running however it looks like users are able to do so. We have installed the slurm_pam_adopt_module and set up the slurm.conf accordingly (the same way we did on our

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-11 Thread Loris Bennett via slurm-users
for individual jobs, when requested. We also don't pre-empt any jobs. Apart from that, I imaging implementing your 'soft' limits robustly might be quite challenging and/or time-consuming, as I am not aware that Slurm has anything like that built in. Cheers, Loris > On Wed,

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-11 Thread Davide DelVento via slurm-users
dering too. On Wed, Jun 11, 2025 at 6:15 AM Loris Bennett via slurm-users < slurm-users@lists.schedmd.com> wrote: > Hi Davide, > > Davide DelVento via slurm-users > writes: > > > In the institution where I work, so far we have managed to live > > without mandatory

[slurm-users] Re: Implementing a "soft" wall clock limit

2025-06-11 Thread Loris Bennett via slurm-users
Hi Davide, Davide DelVento via slurm-users writes: > In the institution where I work, so far we have managed to live > without mandatory wallclock limits (a policy decided well before I > joined the organization), and that has been possible because the > cluster was not very m

[slurm-users] Implementing a "soft" wall clock limit

2025-06-11 Thread Davide DelVento via slurm-users
vance for any suggestions you may provide! -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: CR_CPU used but only cores used

2025-06-11 Thread Adrian Sevcenco via slurm-users
AdrianOn Jun 11, 2025 09:13, Daniel Letai via slurm-users wrote: How are the nodes configured? The question is of CPU definition - is it cores? threads? sockets? That depends on node configuration. See https://slurm.schedmd.com/slurm.conf.

[slurm-users] Re: CR_CPU used but only cores used

2025-06-10 Thread Daniel Letai via slurm-users
core. On 11/06/2025 0:53, Adrian Sevcenco via slurm-users wrote: Hi! i have a weird situation in which only cores are used instead of CPUs this is Alma9/slurm 22.05.9 (the last one from epel) I have: c

[slurm-users] Re: slurm_pam_adopt module not working

2025-06-10 Thread Ole Holm Nielsen via slurm-users
On 6/11/25 06:46, Ratnasamy, Fritz via slurm-users wrote:  We wanted to block users from ssh to a node where there are no jobs running however it looks like users are able to do so. We have installed the slurm_pam_adopt_module and set up the slurm.conf accordingly (the same way we did on our

[slurm-users] slurm_pam_adopt module not working

2025-06-10 Thread Ratnasamy, Fritz via slurm-users
Scientist Information Technology -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] CR_CPU used but only cores used

2025-06-10 Thread Adrian Sevcenco via slurm-users
!! Adrian -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Problem querying slurm batch script archive

2025-06-09 Thread Davide DelVento via slurm-users
ering this odd behavior? Thanks! -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: Restrict and prioritize usage of certain nodes according to accounts

2025-06-09 Thread Prentice Bisbal via slurm-users
as possible, though, it will minimize headaches for you. I did work at one place where a Slurm user could only belong to one account at a time to prevent those sort of issues. Prentice On 5/21/25 4:25 AM, thomas.hartmann--- via slurm-users wrote: Hi, I'm going to have the following sit

[slurm-users] Re: seff command not found

2025-06-09 Thread Kevin Buckley via slurm-users
On 2025/06/07 08:58, David Schanzenbach via slurm-users wrote: I am not sure what OS you are working with, but on our system running Rocky 9, the seff command  is found in the slurm-contribs rpm. Identical situation on a SLES 15 system; # which sacct /usr/bin/sacct # rpm -qf /usr/bin/sacct

[slurm-users] Re: seff command not found

2025-06-06 Thread David Schanzenbach via slurm-users
I am not sure what OS you are working with, but on our system running Rocky 9, the seff command  is found in the slurm-contribs rpm. Thanks, David On 6/6/2025 1:09 PM, Ratnasamy, Fritz via slurm-users wrote: Hi, We installed a new slurm version and it returns "command not found"

[slurm-users] seff command not found

2025-06-06 Thread Ratnasamy, Fritz via slurm-users
erl script. Best, *Fritz Ratnasamy*Data Scientist Information Technology -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: Integration testing of Slinky images

2025-06-06 Thread Nicolas Bock via slurm-users
at you have enough of them for the script(s) you wish to run. > Brian Andrus -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: X11 Forwarding Crashes X Session for too large windows

2025-06-06 Thread Felix Winterhalter via slurm-users
connection that transfer the data has problems. Cheers, Felix On 6/6/25 12:13 AM, Felix Winterhalter via slurm-users wrote: CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and know the content is safe

[slurm-users] Re: Integration testing of Slinky images

2025-06-06 Thread Brian Andrus via slurm-users
n on using (eg: gpus) and that you have enough of them for the script(s) you wish to run. Brian Andrus On 6/6/2025 12:31 PM, Nicolas Bock via slurm-users wrote: Hi, I was wondering whether there is any integration testing done on the containers and the operator published under the

[slurm-users] Integration testing of Slinky images

2025-06-06 Thread Nicolas Bock via slurm-users
Hi, I was wondering whether there is any integration testing done on the containers and the operator published under the slinky project. Or asked differently, once I have deployed slinky, what tests should I run to confirm that the cluster is working as intended? Thanks! Nick -- slurm-users

[slurm-users] Re: X11 performance terrible using plugin

2025-06-06 Thread Davide DelVento via slurm-users
> The issue isn’t network bandwidth Latency. The issue with X is always latency, not bandwidth. On Fri, Jun 6, 2025 at 8:57 AM Simon Andrews via slurm-users < slurm-users@lists.schedmd.com> wrote: > Thanks for the suggestions – some interesting links to read. We already > have a

[slurm-users] Re: X11 performance terrible using plugin

2025-06-06 Thread Davide DelVento via slurm-users
th by papercuts scenario. On Fri, Jun 6, 2025 at 7:09 AM Burian, John via slurm-users < slurm-users@lists.schedmd.com> wrote: > We’ve been using TurboVNC. > > > > *From: *Hadrian Djohari via slurm-users > *Date: *Friday, June 6, 2025 at 8:41 AM > *To: *John Hea

[slurm-users] Re: X11 performance terrible using plugin

2025-06-06 Thread John Hearns via slurm-users
Stupid reply. Reverse name resolution. But surely that would only affect the initial setup of a connection not the data stream. I suggest strace of a graphical application on a computer node. You might be able to spot where any hangs are On Fri, Jun 6, 2025, 3:57 PM Simon Andrews via slurm-users

[slurm-users] Re: X11 performance terrible using plugin

2025-06-06 Thread Simon Andrews via slurm-users
: John Hearns ; Simon Andrews ; slurm-users@lists.schedmd.com Subject: Re: [slurm-users] Re: X11 performance terrible using plugin CAUTION: This email originated outside of the Organisation. Please help to keep us safe and do not click links or open attachments unless you recognise the sender

[slurm-users] Re: X11 performance terrible using plugin

2025-06-06 Thread Burian, John via slurm-users
We’ve been using TurboVNC. From: Hadrian Djohari via slurm-users Date: Friday, June 6, 2025 at 8:41 AM To: John Hearns Cc: Simon Andrews , slurm-users@lists.schedmd.com Subject: [slurm-users] Re: X11 performance terrible using plugin Or use Open OnDemand platform for the interactive Desktop

[slurm-users] Re: X11 performance terrible using plugin

2025-06-06 Thread Jason Simms via slurm-users
-8102 On Fri, Jun 6, 2025 at 8:40 AM Hadrian Djohari via slurm-users < slurm-users@lists.schedmd.com> wrote: > Or use Open OnDemand platform for the interactive Desktop. > https://openondemand.org/ > > On Fri, Jun 6, 2025 at 8:37 AM John Hearns via slurm-users < > slurm

[slurm-users] Re: X11 performance terrible using plugin

2025-06-06 Thread Hadrian Djohari via slurm-users
Or use Open OnDemand platform for the interactive Desktop. https://openondemand.org/ On Fri, Jun 6, 2025 at 8:37 AM John Hearns via slurm-users < slurm-users@lists.schedmd.com> wrote: > Simon, I have had success in the past by using NICE DCV (now owned by AWS > but you can get lic

[slurm-users] Re: X11 performance terrible using plugin

2025-06-06 Thread John Hearns via slurm-users
, Simon Andrews via slurm-users < slurm-users@lists.schedmd.com> wrote: > On our cluster we’ve noticed that if we use the native x11 slurm plugin > (PrologFlags=x11) then X applications work, but are really slow and > unresponsive. Even opening menus on graphical application is p

[slurm-users] X11 performance terrible using plugin

2025-06-06 Thread Simon Andrews via slurm-users
head node the same applications are quick and responsive. Any suggestions for what might be causing this, and how I can get the native x11 to have the same responsiveness as a direct ssh connection? Many thanks Simon. -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubs

[slurm-users] Re: Transfer from GPFS via slurm

2025-06-05 Thread Ryan Novosielski via slurm-users
bandwidth fairly. Sent from my iPhone On Jun 6, 2025, at 00:31, Ratnasamy, Fritz via slurm-users wrote:  Hi, We were told by our hardware provider that large datasets copied from NFS location to GPFS could be conducted via slurm to monitor the transfer. I am not sure of this works as I

[slurm-users] Transfer from GPFS via slurm

2025-06-05 Thread Ratnasamy, Fritz via slurm-users
copy files from on a cluster between NFS and GPFS? Best, *Fritz Ratnasamy*Data Scientist Information Technology -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] X11 Forwarding Crashes X Session for too large windows

2025-06-05 Thread Felix Winterhalter via slurm-users
m printing, copying, forwarding or saving it. Please delete the message and attachments and notify the sender immediately. -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: Changing account names in sacctmgr

2025-06-05 Thread Thomas M. Payerle via slurm-users
nt; verify expected account defaulting when using sbatch, examine the defaultaccount for the user (sacctmgr show user), then change the name of the group w/out changing the defaultaccount and verify account defaulting works as expected. On Thu, Jun 5, 2025 at 3:42 AM Bjørn-Helge Mevik via slurm-users <

[slurm-users] Re: Changing account names in sacctmgr

2025-06-05 Thread Burian, John via slurm-users
As evidenced by numerous responses, my understanding was incorrect. Thanks all for setting me straight, saved me quite a bit of trouble. John From: Thomas M. Payerle via slurm-users Date: Thursday, June 5, 2025 at 9:40 AM To: Bjørn-Helge Mevik , slurm-us...@schedmd.com Subject: [slurm-users

[slurm-users] Re: Changing account names in sacctmgr

2025-06-05 Thread Daniel Letai via slurm-users
- it might not be necessary at all. Just my 2 cents. Sent from my iPhone > On 4 Jun 2025, at 17:11, Burian, John via slurm-users > wrote: > >  > All, > > For reasons I won’t go into, we find we need to change the POSIX group names > for all of our users. My unde

[slurm-users] Re: Changing account names in sacctmgr

2025-06-05 Thread Bjørn-Helge Mevik via slurm-users
"Burian, John via slurm-users" writes: > My understanding is that in the absence of > an --account option, sbatch/salloc assumes the account is the user’s > primary POSIX group. Are you sure about that? I've never heard of such defaulting. My understanding is that w

[slurm-users] Changing account names in sacctmgr

2025-06-04 Thread Burian, John via slurm-users
the associations with the new names and take my lumps on the accounting history? John -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] error building debug packages on ubuntu 24.04

2025-06-03 Thread Charles Hedrick via slurm-users
It was unable to find lib/security/pam_slurm_adopt.so It was built as lib64/security, not lib/security. ./debian/slurm-smd-libpam-slurm-adopt.install needs to be lib64/security/pam_slurm_adopt.so More likely, it needs to be built dynamically, so it will work everywhere. -- slurm-users

[slurm-users] Re: Restrict and prioritize usage of certain nodes according to accounts

2025-06-02 Thread Daniel Letai via slurm-users
ailable in partition project_A_part, the job will be allocated both nodes from general_part, and there is no guarantee the free project_A_ node will be used. HTH, --Dani_L. On 22/05/2025 9:14, Bjørn-Helge Mevik via slurm-u

[slurm-users] Slurm version 25.05 is now available

2025-05-29 Thread Tim Wickberg via slurm-users
Commercial Slurm Development and Support -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Re: Slurm accounts managed via ansible

2025-05-28 Thread Ole Holm Nielsen via slurm-users
available at https://github.com/OleHolmNielsen/Slurm_tools/tree/master/slurmaccounts On 5/28/25 06:24, Ratnasamy, Fritz via slurm-users wrote:  I was wondering  whether there might be built in support for managing slurm accounts,users,associations in ansible. it would be nice to be able t

[slurm-users] Re: Slurm accounts managed via ansible

2025-05-27 Thread John Hearns via slurm-users
?tab=readme-ov-file#8-slurm On Wed, 28 May 2025 at 05:28, Ratnasamy, Fritz via slurm-users < slurm-users@lists.schedmd.com> wrote: > Hi, > > I was wondering whether there might be built in support for managing > slurm accounts,users,associations in ansible. it would be ni

[slurm-users] Slurm accounts managed via ansible

2025-05-27 Thread Ratnasamy, Fritz via slurm-users
r2" *Fritz Ratnasamy* Data Scientist Information Technology -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

[slurm-users] Account getting duplicated

2025-05-27 Thread Ratnasamy, Fritz via slurm-users
Hi, the slurm db duplicates all our account associations. one with cluster=cluster and another where cluster=venus (which is our actual cluster). is that intended? Or should I make any changes? *Fritz Ratnasamy*Data Scientist Information Technology -- slurm-users mailing list -- slurm-users

[slurm-users] Re: Account getting duplicated

2025-05-27 Thread Brian Andrus via slurm-users
If you have multiple clusters and don't specify which one, it will add you to each. That requires multiple entries. Brian Andrus On 5/27/2025 6:20 PM, Ratnasamy, Fritz via slurm-users wrote: Hi, the slurm db duplicates all our account associations. one with cluster=cluster and another

[slurm-users] Re: Slurm Reporting Difference between sreport and sacct

2025-05-23 Thread Paul Raines via slurm-users
ume there is NO truncation going on here and those month numbers are including all time of jobs that ran for anytime in that month but also time in previous or next month. -- Paul Raines (http://help.nmr.mgh.harvard.edu) On Fri, 23 May 2025 8:32am, Passant Hafez via slurm-users wr

[slurm-users] Re: Slurm Reporting Difference between sreport and sacct

2025-05-23 Thread Passant Hafez via slurm-users
option ensures that only the time within the defined period is included. Maybe this can explain some of the discrepancy you experience. Best regards, Steen ____ From: Passant Hafez via slurm-users Sent: Wednesday, May 21, 2025 18:48 To: 'slurm-us...@schedmd.com&

[slurm-users] Re: Crash in "slurmd -C" when latest NVIDIA drivers are used

2025-05-22 Thread Brian Andrus via slurm-users
. Unfortunately, the installation of the latest nvidia-cuda-toolkit did not help. Slurmd -C still crashes. Best regards, Taras *From:* Brian Andrus via slurm-users *Sent:* Tuesday, May 20, 2025 11:22 *To:* slurm-users

[slurm-users] Re: Crash in "slurmd -C" when latest NVIDIA drivers are used

2025-05-22 Thread Taras Shapovalov via slurm-users
Thank you, Brian. Unfortunately, the installation of the latest nvidia-cuda-toolkit did not help. Slurmd -C still crashes. Best regards, Taras From: Brian Andrus via slurm-users Sent: Tuesday, May 20, 2025 11:22 To: slurm-users@lists.schedmd.com Subject

  1   2   3   4   5   6   7   8   9   10   >