Re: [slurm-users] can't lengthen my jobs log

2020-11-12 Thread Erik Bryer
That worked pretty well in that I got more data than I ever have before by a lot. It only goes back about 18 days, but I'm not sure why. The slurmdbd.conf back then contained no directives on retaining logs, which is supposed to mean it defaults to retaining them indefinitely. On another test cl

Re: [slurm-users] can't lengthen my jobs log

2020-11-12 Thread Sebastian T Smith
Hi John, Have you tried specifying a start time? The default is 00:00:00 of the current day (depending on other options). Example: sacct -S 2020-11-01T00:00:00 Our accounting database retains all job data from the epoch of our system. Best, Sebastian -- [University of Nevada, Reno]

[slurm-users] can't lengthen my jobs log

2020-11-12 Thread john abignail
Hi, My jobs database empties after about 1 day. "sacct -a" returns no results. I've tried to lengthen that, but have been unsuccessful. I've tried adding the following to slurmdbd.conf and restarting slurmdbd: ArchiveJobs=yes PurgeEventAfter=1month PurgeJobAfter=12month PurgeResvAfter=1month Purge

[slurm-users] Slurm versions 20.02.6 and 19.05.8 are now available (CVE-2020-27745 and CVE-2020-27746)

2020-11-12 Thread Tim Wickberg
Slurm versions 20.11.0rc2, 20.02.6 and 19.05.8 are now available, and include a series of recent bug fixes, as well as a fix for two security issues. Note: the 19.05 release series is nearing the end of it's support lifecycle as we prepare to release 20.11 later this month. The 19.05.8 downlo

Re: [slurm-users] failed to send msg type 6002: No route to host

2020-11-12 Thread Patrick Bégou
Hi Marcus thanks for your contact. I'm new to slurm deployment and I do not remember where I found this command to check slurm setup. The SallocDefaultCommand is not defined in my slurm.conf file That is strange for me is that it works on the node hosting slurmctld, and on the compute node too.

Re: [slurm-users] failed to send msg type 6002: No route to host

2020-11-12 Thread Pocina, Goran
I think this message can also happen if the slurm.conf on your login node is missing the entry for the slurmd node. 2020 versions have a way to automate sync of the configuration. From: slurm-users On Behalf Of Patrick Bégou Sent: Thursday, November 12, 2020 7:38 AM To: slurm-users@lists.sche

Re: [slurm-users] failed to send msg type 6002: No route to host

2020-11-12 Thread Sean Maxwell
Hi Patrick, I have seen a similar error while configuring native X-forwarding in Slurm. It was caused by Slurm sending an IP to the compute node (as part of a message) that was not routable back to the controller host. In my case it was because the controller host was multihomed, and I had misconf

Re: [slurm-users] failed to send msg type 6002: No route to host

2020-11-12 Thread Marcus Wagner
Hi Patrick, for me at least, this is running as expected. I'm not sure, why you use "sh" as the command for salloc, I never saw that before. If you do not provide a command, the users default shell will be started if the "SallocDefaultCommand" is not set within slurm.conf So, what does $> sa

Re: [slurm-users] failed to send msg type 6002: No route to host

2020-11-12 Thread Patrick Bégou
Hi slurm admins and developpers, no one has an idea about this problem ? Still investigating this morning I discover that it works from the management node (a small VM running slurmctld) even if I have no home directory on it (I use a su command from root to gain unprivileged user setup). It stil

Re: [slurm-users] Raise the priority of a certain kind of jobs

2020-11-12 Thread Zacarias Benta
You can create a QOS with more priority, you can also create a specific partition with highter priority. On 12/11/2020 09:58, SJTU wrote: Hello, We want to raise the priority of a certain kind of slurm jobs. We considered doing it in Prolog, but Prolog seems to run only at job starting time

Re: [slurm-users] Raise the priority of a certain kind of jobs

2020-11-12 Thread Marcus Boden
Hi, you could write a job_submit plugin: https://slurm.schedmd.com/job_submit_plugins.html The Site factor was added to priority for that exact reason. Best, Marcus On 11/12/20 10:58 AM, SJTU wrote: > Hello, > > We want to raise the priority of a certain kind of slurm jobs. We considered > do

Re: [slurm-users] Raise the priority of a certain kind of jobs

2020-11-12 Thread Ole Holm Nielsen
On 11/12/20 10:58 AM, SJTU wrote: Hello, We want to raise the priority of a certain kind of slurm jobs. We considered doing it in Prolog, but Prolog seems to run only at job starting time so may not be useful for queued jobs. Is there any possible way to do this? You can add a negative "nic

[slurm-users] Raise the priority of a certain kind of jobs

2020-11-12 Thread SJTU
Hello, We want to raise the priority of a certain kind of slurm jobs. We considered doing it in Prolog, but Prolog seems to run only at job starting time so may not be useful for queued jobs. Is there any possible way to do this? Thank you and look forward to your reply. Best, Jianwen