[slurm-users] gres definitions

2020-12-14 Thread john abignail
Hi, I have gres defined in my partition name. If I assign some bogus gres to a node, the partition stops working. Somehow Slurm and the OS agree on the gres installed. How do I find out all the named gres in my system, e.g. specific cpu types, not just "cpu". Thanks, John

[slurm-users] slurmctld daemon error

2020-12-14 Thread Alpha Experiment
Hi, I am trying to run slurm on Fedora 33. Upon boot the slurmd daemon is running correctly; however the slurmctld daemon always errors. [admin@localhost ~]$ systemctl status slurmd.service ● slurmd.service - Slurm node daemon Loaded: loaded (/etc/systemd/system/slurmd.service; enabled; vendo

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Luke Yeager
What does your ‘slurmctld.service’ look like? You might want to add something to the ‘After=’ section if your service is starting too quickly. e.g. we use ‘After=network.target munge.service’ (see here

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Avery Grieve
Hey Luke, I'm getting the same issues with my slurmctld daemon not starting on boot (as well as my slurmd daemon). Both fail with the same messages John got above (just Exit Code). My slurmctld service file in /etc/systemd/system/ looks like this: [Unit] Description=Slurm controller daemon After=

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Alpha Experiment
Hi Luke and Avery, Changed the After line in the slurmctld.service file to After=network.target munge.service slurmd.service This seemed to do the trick! Best, John On Mon, Dec 14, 2020 at 6:10 PM Avery Grieve wrote: > Hey Luke, I'm getting the same issues with my slurmctld daemon not > start

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Brian Andrus
Check your hosts file and ensure 'localhost' does not have an IPV6 address associated with it. Brian Andrus On 12/14/2020 4:19 PM, Alpha Experiment wrote: Hi, I am trying to run slurm on Fedora 33. Upon boot the slurmd daemon is running correctly; however the slurmctld daemon always errors.

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Alpha Experiment
Hi Brian, My hosts file looks like this: 127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4 ::1 localhost localhost.localdomain localhost6 localhost6.localdomain6 I believe the second is an IPV6 address. Is it safe to delete that line? Best, John On Mon, Dec

[slurm-users] Scripts run slower in slurm?

2020-12-14 Thread Alpha Experiment
Hi, I made a short script in python to test if slurm was correctly limiting the number of CPUs available to each partition. The script is as follows: import multiprocessing as mp import time as t def fibonacci(n): n = int(n) def fibon(a,b,n,result): c = a+b result.append(c