Re: [slurm-users] [External] Use nodes exclusive and shared simultaneously

2021-03-05 Thread Prentice Bisbal
I have what you request setup in my environment. Preemption is the only way to go. And you don't want to suspend the pre-empted jobs - you want to requeue them. If you suspend them, the just stop executing but still stay in memory, so the amount of memory needed by the preempting job may not be

Re: [slurm-users] [External] Get original script of a job

2021-03-05 Thread Prentice Bisbal
I have this same problem - a user writes to me about a problem they're having with Slurm, and instead of pasting the script into the support ticket or e-mail, give me the path to it. When finally get to looking at it, it's obviously they've changed it since reporting the issue, so I have no for

Re: [slurm-users] [External] About sacct --format: detailed info about the fields

2021-03-05 Thread Prentice Bisbal
Which fields are you talking about. All of the fields seem to be well-documented in the sacct man page (https://slurm.schedmd.com/sacct.html). For example, you mention the time fields. For format for all the time fields seem to be rather well documented (See below) and you can use the SLURM_TIM

Re: [slurm-users] NVML autodetect "Failed to get supported memory frequencies" error

2021-03-05 Thread Joshua Baker-LePain
On Fri, 5 Mar 2021 at 7:51am, Kilian Cavalotti wrote Hi Joshua, On Thu, Mar 4, 2021 at 8:38 PM Joshua Baker-LePain wrote: slurmd: error: _nvml_get_mem_freqs: Failed to get supported memory frequencies slurmd: error: for the GPU : Not Supported slurmd: 4 GPU system device(s) detected slu

Re: [slurm-users] NVML autodetect "Failed to get supported memory frequencies" error

2021-03-05 Thread Kilian Cavalotti
Hi Joshua, On Thu, Mar 4, 2021 at 8:38 PM Joshua Baker-LePain wrote: > slurmd: error: _nvml_get_mem_freqs: Failed to get supported memory > frequencies > slurmd: error: for the GPU : Not Supported > slurmd: 4 GPU system device(s) detected > slurmd: WARNING: The following autodetected GPUs a

[slurm-users] Use nodes exclusive and shared simultaneously

2021-03-05 Thread Heckes, Frank
Hi all, Sorry if this has been ask and answered before. Resulting from a user/owner requirement I would need to set-up a subset of a nodes in a partition be used as common shared resources still, but these nodes should be available with smallest latency possible in case the owner wants run a

Re: [slurm-users] Get original script of a job

2021-03-05 Thread Carl Ponder
I put this line in my job-control file (written in bash) to capture the original as part of the run: cp $0 $RUNDIR/$SLURM_JOB_NAME The $0 gives the full path to the working copy of the script, so it expands to this for example: /fs/slurm/var/spool/job67842/slurm_script It depends on t

Re: [slurm-users] Get original script of a job

2021-03-05 Thread Ward Poelmans
Hi, On 5/03/2021 11:29, Alberto Morillas, Angelines wrote: > I know that when I send a job with scontroI can get the path and the > name of the script used to send this job, but normally the users change > theirs scripts and sometimes all was wrong after that, so is there any > possibility to rep

Re: [slurm-users] slurm-users Digest, Vol 41, Issue 13

2021-03-05 Thread Alberto Morillas, Angelines
Thanks! De: slurm-users en nombre de slurm-users-requ...@lists.schedmd.com Enviado: Friday, March 5, 2021 1:00:01 PM Para: slurm-users@lists.schedmd.com Asunto: slurm-users Digest, Vol 41, Issue 13 Send slurm-users mailing list submissions to slurm-use

Re: [slurm-users] Get original script of a job

2021-03-05 Thread Ole Holm Nielsen
On 05-03-2021 11:29, Alberto Morillas, Angelines wrote: I would like to know if it will be possible to get the script that was used to send a job. I know that when I send a job with scontroI can get the path and the name of the script used to send this job, but normally the users change their

[slurm-users] Get original script of a job

2021-03-05 Thread Alberto Morillas, Angelines
Hi, I would like to know if it will be possible to get the script that was used to send a job. I know that when I send a job with scontroI can get the path and the name of the script used to send this job, but normally the users change theirs scripts and sometimes all was wrong after that, so i

Re: [slurm-users] About sacct --format: how can I get info about the fields

2021-03-05 Thread xiaojingh...@163.com
Hi, Marcus, Very grateful for you reply! I will try it out just as you suggested. Best Regards, Xiaojing > On Mar 5, 2021, at 3:59 PM, Marcus Boden wrote: > > Hi Xiaojing, > > my experience here is: you will have to try it out and see what works. At > least that's what I do whenever I parse

Re: [slurm-users] About sacct --format: how can I get info about the fields

2021-03-05 Thread Marcus Boden
Hi Xiaojing, my experience here is: you will have to try it out and see what works. At least that's what I do whenever I parse sacct, as I did not find a detailed description anywhere. The manpage is quite incomplete in that regard. Best, Marcus On 05.03.21 03:02, xiaojingh...@163.com wrote