On 01/04/2017 01:15 AM, Klaus Wenninger wrote:
On 01/03/2017 09:02 PM, Muhammad Sharfuddin wrote:
On 01/03/2017 09:49 PM, Kristoffer Grönlund wrote:
Muhammad Sharfuddin <[email protected]> writes:

Hello,

pacemaker does not start on this machine(Fujitsu PRIMERGY RX2540 M1)
with following error in  the logs:

sbd: [13236]: ERROR: Cannot open watchdog device: /dev/watchdog: No such
file or directory
Does /dev/watchdog exist?
No
ls -l /dev/watch*
ls: cannot access /dev/watch*: No such file or directory
Then you probably don't have one at all.
Maybe no hardware,the driver is not loaded or e.g.
udev doesn't create the node for some reason.
For a test at least you can try loading loading softdog.

[kwenning@kwenning pacemaker]$ sudo modprobe softdog
[sudo] password for kwenning:
[kwenning@kwenning pacemaker]$ ls -l /dev/watchdog
crw-------. 1 root root 10, 130 Jan  3 13:39 /dev/watchdog
On the node where I am getting the sbd error, by loading the "softdog" module helps fix the issue

Now the strange part.. this is happening on a single node only, i.e only on a single
node we are getting the error:
sbd: ERROR: Cannot open watchdog device: /dev/watchdog: No such file or directory

while on another node(100% identical, same OS/Software/configurations and same Hardware) even though /dev/watchdog is also missing there, but still pacemaker and sbd starts there without any
error and without loading the "softdog" module.

If you have more than one watchdog device, you can configure
sbd to use a different device using the -w option.

Cheers,
Kristoffer

System Info:

sbd-1.2.1-8.7.x86_64  corosync-2.3.3-7.12.x86_64 pacemaker-1.1.12-7.1.x86_64

lsmod | egrep "(wd|dog)"
iTCO_wdt               13480  0
iTCO_vendor_support    13718  1 iTCO_wdt

dmidecode | grep -A3 '^System Information'
System Information
          Manufacturer: FUJITSU
          Product Name: PRIMERGY RX2540 M1
          Version: GS01

logs:

2017-01-03T21:00:26.890503+05:00 prdnode1 sbd: [13235]: info: Watchdog
enabled.
2017-01-03T21:00:26.899817+05:00 prdnode1 sbd: [13238]: info: Servant
starting for device
/dev/disk/by-id/wwn-0x600000e00d280000002825b500000000-part1
2017-01-03T21:00:26.900175+05:00 prdnode1 sbd: [13238]: info: Device
/dev/disk/by-id/wwn-0x600000e00d280000002825b500000000-part1 uuid:
fda42d64-ca74-4578-90c8-976ea7ff5f6e
2017-01-03T21:00:26.900418+05:00 prdnode1 sbd: [13239]: info: Monitoring
Pacemaker health
2017-01-03T21:00:27.901022+05:00 prdnode1 sbd: [13236]: ERROR: Cannot
open watchdog device: /dev/watchdog: No such file or directory
2017-01-03T21:00:27.912098+05:00 prdnode1 sbd: [13236]: WARN: Servant
for pcmk (pid: 13239) has terminated
2017-01-03T21:00:27.941950+05:00 prdnode1 sbd: [13236]: WARN: Servant
for /dev/disk/by-id/wwn-0x600000e00d280000002825b500000000-part1 (pid:
13238) has terminated
2017-01-03T21:00:27.949401+05:00 prdnode1 sbd.sh[13231]: sbd failed;
please check the logs.
2017-01-03T21:00:27.992606+05:00 prdnode1 sbd.sh[13231]: SBD failed to
start; aborting.
2017-01-03T21:00:27.993061+05:00 prdnode1 systemd[1]: sbd.service:
control process exited, code=exited status=1
2017-01-03T21:00:27.993339+05:00 prdnode1 systemd[1]: Failed to start
Shared-storage based fencing daemon.
2017-01-03T21:00:27.993610+05:00 prdnode1 systemd[1]: Dependency failed
for Pacemaker High Availability Cluster Manager.
2017-01-03T21:00:27.994054+05:00 prdnode1 systemd[1]: Unit sbd.service
entered failed state.

please help.

--
Regards,

Muhammad Sharfuddin
<http://www.nds.com.pk>

_______________________________________________
Users mailing list: [email protected]
http://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org
Regards,

Muhammad Sharfuddin


_______________________________________________
Users mailing list: [email protected]
http://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org


_______________________________________________
Users mailing list: [email protected]
http://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

--
Regards,

Muhammad Sharfuddin

_______________________________________________
Users mailing list: [email protected]
http://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to