Dear Chris,

I could not find this warning in the slurm.conf man page. So I googled
it and found a reference in the Slurm developers documentation: 

https://slurm.schedmd.com/jobacct_gatherplugins.html

However, this web page says in its footer: "Last modified 27 March 2015". 
So maybe (means: hopefully) this caveat is somewhat outdated today. 

I have also `JobAcctGatherType=jobacct_gather/cgroup´ in my slurm.conf 
but for no deeper reason than that we also use cgroups for
process tracking (i.e. ProctrackType=proctrack/cgroup) and to limit 
resources used by users. So it just felt more consistent to me to 
use cgroups for jobacct_gather plugin as well - even though SchedMD 
recommends jobacct_gather/linux (according to the slurm.conf man page)

That said, I'd also be interested in the pros and cons of jobacct_gather/cgroup 
versus jobacct_gather/linux and also why jobacct_gather/linux is the recommended
one.

Best regards
Jürgen

-- 
Jürgen Salk
Scientific Software & Compute Services (SSCS)
Kommunikations- und Informationszentrum (kiz)
Universität Ulm
Telefon: +49 (0)731 50-22478
Telefax: +49 (0)731 50-22471





* Christopher Benjamin Coffey <chris.cof...@nau.edu> [191022 16:26]:
> Hi,
> 
> We've been using jobacct_gather/cgroup for quite some time and haven't had 
> any issues (I think). We do see some lengthy job cleanup times when there are 
> lots of small jobs completing at once, maybe that is due to the cgroup 
> plugin. At SLUG19 a slurm dev presented information that the 
> jobacct_gather/cgroup plugin has quite the performance hit and that 
> jobacct_gather/linux should be set instead. 
> 
> Can someone help me with the difference between these two gather plugins? If 
> one were to switch to jobacct_gather/linux, what are the cons? Do you lose 
> some job resource usage information?
> 
> Checking out the docs again on schedmd site regarding the jobacct_gather 
> plugins I see:
> 
> cgroup — Gathers information from Linux cgroup infrastructure and adds this 
> information to the standard rusage information also gathered for each job. 
> (Experimental, not to be used in production.)
> 
> I don't believe I saw that before: "Experimental" ! Hah.
> 
> Thanks!
> 
> Best,
> Chris
>  
> -- 
> Christopher Coffey
> High-Performance Computing
> Northern Arizona University
> 928-523-1167
>  
>  
> 

-- 
GPG A997BA7A | 87FC DA31 5F00 C885 0DC3  E28F BD0D 4B33 A997 BA7A

Reply via email to