Re: [slurm-users] slurmctld daemon error

2020-12-15 Thread mercan
Oh, yes! sorry, I confuse with the your email and Alpha Experiment's emails. Ahmet M. 15.12.2020 21:59 tarihinde Avery Grieve yazdı: Hi Ahmet, Thank you for your suggestion. I assume you're talking about the SlurmctldHost field in the slurm.conf file? If so, I've got that variable defined

Re: [slurm-users] slurmctld daemon error

2020-12-15 Thread mercan
Hi; I dont know the problem is this, but, I think the setting "ControlMachine=localhost" and not setting a hostname for slurm master node are not good decisions. How compute nodes decide the ip address of the slurm masternode from "localhost". Also, I suggest not using capital letters for any

Re: [slurm-users] slurmctld daemon error

2020-12-15 Thread Avery Grieve
I changed my .service file to write to a log. The slurm daemons are running (manual start) on the compute nodes. I get this on startup with the service enabled: [2020-12-15T18:09:06.412] slurmctld version 20.11.1 started on cluster cluster [2020-12-15T18:09:06.539] No memory enforcing mechanism co

Re: [slurm-users] slurmctld daemon error

2020-12-15 Thread Avery Grieve
Maybe a silly question, but where do you find the daemon logs or specify their location? ~Avery Grieve They/Them/Theirs please! University of Michigan On Mon, Dec 14, 2020 at 7:22 PM Alpha Experiment wrote: > Hi, > > I am trying to run slurm on Fedora 33. Upon boot the slurmd daemon is > runni

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Alpha Experiment
Hi Brian, My hosts file looks like this: 127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4 ::1 localhost localhost.localdomain localhost6 localhost6.localdomain6 I believe the second is an IPV6 address. Is it safe to delete that line? Best, John On Mon, Dec

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Brian Andrus
Check your hosts file and ensure 'localhost' does not have an IPV6 address associated with it. Brian Andrus On 12/14/2020 4:19 PM, Alpha Experiment wrote: Hi, I am trying to run slurm on Fedora 33. Upon boot the slurmd daemon is running correctly; however the slurmctld daemon always errors.

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Alpha Experiment
Hi Luke and Avery, Changed the After line in the slurmctld.service file to After=network.target munge.service slurmd.service This seemed to do the trick! Best, John On Mon, Dec 14, 2020 at 6:10 PM Avery Grieve wrote: > Hey Luke, I'm getting the same issues with my slurmctld daemon not > start

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Avery Grieve
Hey Luke, I'm getting the same issues with my slurmctld daemon not starting on boot (as well as my slurmd daemon). Both fail with the same messages John got above (just Exit Code). My slurmctld service file in /etc/systemd/system/ looks like this: [Unit] Description=Slurm controller daemon After=

Re: [slurm-users] slurmctld daemon error

2020-12-14 Thread Luke Yeager
What does your ‘slurmctld.service’ look like? You might want to add something to the ‘After=’ section if your service is starting too quickly. e.g. we use ‘After=network.target munge.service’ (see here