from:"Pablo Escobar"

[slurm-users] bug when using SlurmctldParameters=cloud_reg_addrs ? error: get_name_info: getnameinfo() failed: Name or service not known

2021-10-25 Thread Pablo Escobar Lopez

Hi, I have configured slurm cloud scheduling for OpenStack. I am using CentOS7 with slurm version 20.11.8 installed using EPEL RPMs and it's working fine but I am getting some strange errors in the slurm master logs which I think are a bug. I am using these options in slurm.conf: SlurmctldParamet

Re: [slurm-users] Can I get the original sbatch command, after the fact?

2021-07-17 Thread Pablo Escobar Lopez

You can check the sarchive tool. https://archive.fosdem.org/2020/schedule/event/job_script_archival/ https://github.com/itkovian/sarchive Regards, Pablo. On Fri, Jul 16, 2021 at 8:29 PM Paul Edmon wrote: > Not in the current version of Slurm. In the next major version long > term storage of j

[slurm-users] examples or docs about Slurm cloud bursting on OpenStack ?

2021-07-07 Thread Pablo Escobar Lopez

Hi, I am exploring the option to use the Slurm elastic computing support ( https://slurm.schedmd.com/elastic_computing.html ) together with the Slurm configless support ( https://slurm.schedmd.com/configless_slurm.html ) to deploy dynamic Slurm clusters on OpenStack which can automatically grow an

Re: [slurm-users] How to deal with user running stuff in frontend node?

2018-02-15 Thread Pablo Escobar

Hi Manuel, A possible workaround is to configure a cgroups limit by user in the frontend node so a single user cannot allocate more than 1GB of ram (or whatever value you prefer). The user would still be able to abuse the machine but as soon as his memory usage goes above the limit his job will be

[slurm-users] slow sacct queries after upgrading to 17.11.0

2017-12-18 Thread Pablo Escobar

Hi, We have upgraded from 17.02.3 to 17.11.0 and after the upgrade we have noticed that a simple "sacct -j $jobid" takes much longer than before. Before the upgrade sacct was near immediate and now it takes around 1 minute. After enabling the slow queries log in mariadb we have found this slow qu

[slurm-users] bug when using SlurmctldParameters=cloud_reg_addrs ? error: get_name_info: getnameinfo() failed: Name or service not known

Re: [slurm-users] Can I get the original sbatch command, after the fact?

[slurm-users] examples or docs about Slurm cloud bursting on OpenStack ?

Re: [slurm-users] How to deal with user running stuff in frontend node?

[slurm-users] slow sacct queries after upgrading to 17.11.0

5 matches

Site Navigation

Mail list logo

Footer information