Public bug reported:
autopkgtest logs:
https://objectstorage.prodstack5.canonical.com/swift/v1/AUTH_0f9aae918d5b4744bf7b827671c86842/autopkgtest-questing/questing/amd64/p/prometheus-hacluster-exporter/20250819_152305_86a4a@/log.gz
I tried it locally with set -ex, and also a bunch of extra `echo`
statements, and found nothing failing explicitly. It is believed it's
likely a kernel bug:
```
autopkgtest [14:41:48]: test sbd: [-----------------------
+ DISK=/tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ echo /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ EXPORT=/tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ trap cleanup 0 2 3 15
+ printf \n=== losetup ===\n
+ dd if=/dev/zero of=/tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk bs=1M count=10
/tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
=== losetup ===
+ losetup --find --show /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
10+0 records in
10+0 records out
10485760 bytes (10 MB, 10 MiB) copied, 0.00912326 s, 1.1 GB/s
+ LOOP=/dev/loop0
+ printf \n=== create ===\n
+ sbd -d /dev/loop0 create
=== create ===
+ printf \n=== allocate ===\n
+ hostname
+ Initializing device /dev/loop0
Creating version 2.1 header on device 3 (uuid:
4eaba40a-d505-4951-8058-ee6221807b20)
Initializing 255 slots on device 3
Device /dev/loop0 is initialized.
Did you check sbd service down on all nodes before? If not do so now and
restart afterwards.
=== allocate ===
sbd -d /dev/loop0 allocate autopkgtest
+ printf \n=== list ===\n
+ sbd -d /dev/loop0 list
Trying to allocate slot for autopkgtest on device /dev/loop0.
slot 0 is unused - trying to own
Slot for autopkgtest has been allocated on /dev/loop0.
=== list ===
0 autopkgtest clear
=== watch ===
+ printf \n=== watch ===\n
+ mkdir -p /var/lib/pacemaker/cores
+ sbd -d /dev/loop0 -W -W watch
+ pgrep -a sbd
+ printf \n=== dump ===\n
+ sbd -d /dev/loop0 dump
1988 /bin/sh /tmp/autopkgtest.SPvFUS/build.LUR/real-tree/debian/tests/sbd
2001 sbd: inquisitor
2002 sbd: watcher: /dev/loop0 - slot: 0 - uuid:
4eaba40a-d505-4951-8058-ee6221807b20
2003 sbd: watcher: Pacemaker
2004 sbd: watcher: Cluster
=== dump ===
+ printf \n=== exporter ===\n
+ echo SBD_DEVICE=/dev/loop0
+ service ha_cluster_exporter restart
==Dumping header on disk /dev/loop0
Header version : 2.1
UUID : 4eaba40a-d505-4951-8058-ee6221807b20
Number of slots : 255
Sector size : 512
Timeout (watchdog) : 5
Timeout (allocate) : 2
Timeout (loop) : 1
Timeout (msgwait) : 10
==Header on disk /dev/loop0 is dumped
=== exporter ===
+ service ha_cluster_exporter status
+ echo 0
+ seq 1 10
+ ss state listening sport = :9664
+ grep -q ^tcp
● ha_cluster_exporter.service - Prometheus exporter for Pacemaker HA clusters
metrics
Loaded: loaded (/usr/lib/systemd/system/ha_cluster_exporter.service;
enabled; preset: enabled)
Active: active (running) since Wed 2025-08-20 14:41:52 BST; 22ms ago
Invocation: 490e407dd09f44918ada75a4e6208065
Main PID: 2012 (ha_cluster_expo)
Tasks: 5 (limit: 2335)
Memory: 1.9M (peak: 1.9M)
CPU: 8ms
CGroup: /system.slice/ha_cluster_exporter.service
└─2012 /usr/bin/ha_cluster_exporter
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.189Z caller=main.go:274 level=info msg="Using config
file: /etc/ha_cluster_exporter.yaml"
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.189Z caller=main.go:280 level=warn msg="Registration
failure" err="could not initialize 'pacemaker' collector: '/usr/sbin/crm_mon'
does not exist"
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.189Z caller=main.go:280 level=warn msg="Registration
failure" err="could not initialize 'corosync' collector:
'/usr/sbin/corosync-cfgtool' does not exist"
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.189Z caller=main.go:280 level=warn msg="Registration
failure" err="could not initialize 'drbd' collector: '/sbin/drbdsetup' does not
exist"
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.189Z caller=main.go:288 level=info msg="sbd collector
registered."
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.190Z caller=main.go:328 level=info msg="Serving metrics
on 0.0.0.0:9664/metrics"
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.190Z caller=main.go:349 level=warn msg="Reading web
config file failed" err="stat /etc/ha_cluster_exporter.web.yaml: no such file
or directory"
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.191Z caller=main.go:350 level=info msg="Default web
config or commandline values will be used"
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.191Z level=info caller=tls_config.go:353 msg="Listening
on" address=[::]:9664
Aug 20 14:41:52 autopkgtest ha_cluster_exporter[2012]:
ts=2025-08-20T13:41:52.192Z level=info caller=tls_config.go:356 msg="TLS is
disabled." http2=false address=[::]:9664
0
=== export ===
+ break
+ printf \n=== export ===\n
+ curl --silent --show-error http://localhost:9664/metrics
# HELP ha_cluster_sbd_devices SBD devices; one line per device
# TYPE ha_cluster_sbd_devices gauge
ha_cluster_sbd_devices{device="/dev/loop0",status="healthy"} 1
# HELP ha_cluster_sbd_timeouts SBD timeouts for each device and type
# TYPE ha_cluster_sbd_timeouts gauge
ha_cluster_sbd_timeouts{device="/dev/loop0",type="msgwait"} 10
ha_cluster_sbd_timeouts{device="/dev/loop0",type="watchdog"} 5
# HELP ha_cluster_scrape_duration_seconds Duration of a collector scrape.
# TYPE ha_cluster_scrape_duration_seconds gauge
ha_cluster_scrape_duration_seconds{collector="sbd"} 0.027970625
# HELP ha_cluster_scrape_success Whether a collector succeeded.
# TYPE ha_cluster_scrape_success gauge
ha_cluster_scrape_success{collector="sbd"} 1
# HELP process_cpu_seconds_total Total user and system CPU time spent in
seconds.
# TYPE process_cpu_seconds_total counter
process_cpu_seconds_total 0
# HELP process_max_fds Maximum number of open file descriptors.
# TYPE process_max_fds gauge
process_max_fds 524287
# HELP process_network_receive_bytes_total Number of bytes received by the
process over the network.
# TYPE process_network_receive_bytes_total counter
process_network_receive_bytes_total 1.1490227e+07
# HELP process_network_transmit_bytes_total Number of bytes sent by the process
over the network.
# TYPE process_network_transmit_bytes_total counter
process_network_transmit_bytes_total 67386
# HELP process_open_fds Number of open file descriptors.
# TYPE process_open_fds gauge
process_open_fds 7
# HELP process_resident_memory_bytes Resident memory size in bytes.
# TYPE process_resident_memory_bytes gauge
process_resident_memory_bytes 1.4647296e+07
# HELP process_start_time_seconds Start time of the process since unix epoch in
seconds.
# TYPE process_start_time_seconds gauge
process_start_time_seconds 1.75569731143e+09
# HELP process_virtual_memory_bytes Virtual memory size in bytes.
# TYPE process_virtual_memory_bytes gauge
process_virtual_memory_bytes 1.303298048e+09
# HELP process_virtual_memory_max_bytes Maximum amount of virtual memory
available in bytes.
# TYPE process_virtual_memory_max_bytes gauge
process_virtual_memory_max_bytes 1.8446744073709552e+19
# HELP promhttp_metric_handler_requests_in_flight Current number of scrapes
being served.
# TYPE promhttp_metric_handler_requests_in_flight gauge
promhttp_metric_handler_requests_in_flight 1
# HELP promhttp_metric_handler_requests_total Total number of scrapes by HTTP
status code.
# TYPE promhttp_metric_handler_requests_total counter
promhttp_metric_handler_requests_total{code="200"} 0
promhttp_metric_handler_requests_total{code="500"} 0
promhttp_metric_handler_requests_total{code="503"} 0
ha_cluster_sbd_devices{device="/dev/loop0",status="healthy"} 1
ha_cluster_sbd_timeouts{device="/dev/loop0",type="msgwait"} 10
ha_cluster_sbd_timeouts{device="/dev/loop0",type="watchdog"} 5
**********************************************
0
**********************************************
we're in cleanup! Last command exited: 0
removing loop!
0
removing disk!
0
removing export!
0
+ cat /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ grep /dev/loop0 /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ retcode=0
+ echo **********************************************
+ echo 0
+ echo **********************************************
+ cleanup
+ echo we're in cleanup! Last command exited: 0
+ echo removing loop!
+ test /dev/loop0
+ losetup -d /dev/loop0
+ echo 0
+ echo removing disk!
+ test -f /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ rm -f /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ echo 0
+ echo removing export!
+ test -f /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ rm -f /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ echo 0
autopkgtest [14:41:52]: test sbd: -----------------------]
autopkgtest [14:41:53]: test sbd: - - - - - - - - - - results - - - - - - - -
- -
sbd FAIL stderr: +
DISK=/tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
autopkgtest [14:41:53]: test sbd: - - - - - - - - - - stderr - - - - - - - - -
-
+ DISK=/tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ echo /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ EXPORT=/tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ trap cleanup 0 2 3 15
+ printf \n=== losetup ===\n
+ dd if=/dev/zero of=/tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk bs=1M count=10
+ losetup --find --show /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ LOOP=/dev/loop0
+ printf \n=== create ===\n
+ sbd -d /dev/loop0 create
+ printf \n=== allocate ===\n
+ hostname
+ sbd -d /dev/loop0 allocate autopkgtest
+ printf \n=== list ===\n
+ sbd -d /dev/loop0 list
+ printf \n=== watch ===\n
+ mkdir -p /var/lib/pacemaker/cores
+ sbd -d /dev/loop0 -W -W watch
+ pgrep -a sbd
+ printf \n=== dump ===\n
+ sbd -d /dev/loop0 dump
+ printf \n=== exporter ===\n
+ echo SBD_DEVICE=/dev/loop0
+ service ha_cluster_exporter restart
+ service ha_cluster_exporter status
+ echo 0
+ seq 1 10
+ ss state listening sport = :9664
+ grep -q ^tcp
+ break
+ printf \n=== export ===\n
+ curl --silent --show-error http://localhost:9664/metrics
+ cat /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ grep /dev/loop0 /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ retcode=0
+ echo **********************************************
+ echo 0
+ echo **********************************************
+ cleanup
+ echo we're in cleanup! Last command exited: 0
+ echo removing loop!
+ test /dev/loop0
+ losetup -d /dev/loop0
+ echo 0
+ echo removing disk!
+ test -f /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ rm -f /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/disk
+ echo 0
+ echo removing export!
+ test -f /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ rm -f /tmp/autopkgtest.SPvFUS/autopkgtest_tmp/export
+ echo 0
autopkgtest [14:41:54]: @@@@@@@@@@@@@@@@@@@@ summary
```
** Affects: prometheus-hacluster-exporter (Ubuntu)
Importance: Undecided
Status: New
** Tags: update-excuse
** Tags added: update-excuse
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2121079
Title:
sbd autopkgtest inexplicably/quietly fails with glibc/2.42-0ubuntu1+
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/prometheus-hacluster-exporter/+bug/2121079/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs