Package: mod-gearman-tools
Version: 1.2.4-1
Severity: normal
File: /usr/lib/nagios/plugins/check_gearman

Hello there,

/usr/lib/nagios/plugins/check_gearman gives a hint on how to monitor a
single worker:

$ /usr/lib/nagios/plugins/check_gearman | tail -n 5
| Check worker:
| 
| %> ./check_gearman -H <job server hostname> -q worker_<worker hostname> -t 10 
-s check
| check_gearman OK - host has 5 worker and is working on 0 jobs|worker=5 
jobs=96132c
| 

This check previously worked just fine in my local Icinga setup, the
last good check output was
| check_gearman OK - fernst has 3 worker and is working on 0 jobs. Version: 
1.2.0

Then I upgraded mod-gearman. First from 1.2.0-1 to 1.2.2-2, then further
to 1.2.4-1. Ever since the first upgrade the check fails:
$ /usr/lib/nagios/plugins/check_gearman -H 127.0.0.1 -q worker_fernst -s check
| check_gearman CRITICAL - Job failed: _client_do(GEARMAN_TIMEOUT) occured 
during gearman_client_run_tasks() -> libgearman/client.cc:167

However, the queue is accessible:
$ /usr/lib/nagios/plugins/check_gearman -H 127.0.0.1 -q worker_fernst 
| check_gearman OK - 0 jobs running and 1 job waiting. Version: 
0.28|'worker_fernst_waiting'=1;10;100;0 'worker_fernst_running'=0 
'worker_fernst_worker'=1;25;50;0

Here the corresponding output from gearman_top:
| 2012-03-13 13:54:41  -  localhost:4730   -  v0.28
| 
|  Queue Name    | Worker Available | Jobs Waiting | Jobs Running
| ----------------------------------------------------------------
|  check_results |               1  |           0  |           0
|  host          |               1  |           0  |           0
|  perfdata      |               1  |           0  |           0
|  service       |               1  |           0  |           0
|  worker_fernst |               1  |           1  |           0
| ----------------------------------------------------------------
| 

Note: "Jobs Waiting" is 0 directly after restarting mod-gearman-worker,
but changes to 1 and remains so as soon as the failing check from above
runs for the first time thereafter.

To me this looks like a bug: either the sample check as output by
check_gearman is wrong, or the check just broke with the recent updates.
Please advise.

If you need any further info please don't hesitate to ask.

Cheers,
Flo

-- System Information:
Debian Release: wheezy/sid
  APT prefers testing
  APT policy: (990, 'testing'), (50, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)

Kernel: Linux 3.2.0-1-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages mod-gearman-tools depends on:
ii  libc6        2.13-27
ii  libgearman6  0.28-1
ii  libncurses5  5.9-4
ii  libtinfo5    5.9-4

mod-gearman-tools recommends no packages.

mod-gearman-tools suggests no packages.

-- no debconf information



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to