Package: zabbix
Version: 1:1.1-5

I'm using zabbix backported to Sarge (made by myself).
I found both agent and server sometimes fail to restart during log rotation, 
because tcp port they listen is still in use (in TIME_WAIT state). It 
happen about one on ten restarts of agnet or daemon (when restarted 
mannually).

Output from zabbix_agent.log:
Cannot bind to port 10050. Error [Address already in use]. Another 
zabbix_agentd already running ?

netstat shows:
netstat -an | grep 10050
tcp        0      0 192.168.101.20:10050    192.168.101.20:32848    
TIME_WAIT

I found this on Zabbix Forum:
http://www.zabbix.com/forum/printthread.php?t=1826

Problem could be fixed by adding this line after stopping the zabbix_agent 
in init file:
while [ `netstat -an | grep 10050 | wc -l` -ne 0 ]; do :; done

and this line after stopping zabbix_server:
while [ `netstat -an | grep 10051 | wc -l` -ne 0 ]; do :; done

assuming, that agent and server listen on default port.

I suggest adding some kind of detection if tcp port, agent or server is 
using
a) before agent or server start
or
b) after agent or server is stopped

My system:
Debian Sarge
Linux monitor.prz.edu.pl 2.6.16-2-vserver-686 #2 SMP Tue May 23 07:55:55 
CEST 2006 i686 GNU/Linux

-- 
Krzysztof Raczkowski
Rzeszow University of Technology
System Administrator of CZ RMSK
tel.: (017) 865-13-93; e-mail: [EMAIL PROTECTED]


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to