Hi,
I added "Client Run Before/After Job" scripts to two backup jobs in
order to stop/start mysql. But the "After" script seems to hang the
bacula-fd. Using 'status dir', the bacula console shows a terminated
status for these jobs, and the command 'list jobs' shows their status as
'R'.
The script being called is the FreeBSD rc script used to start/stop
mysql: /usr/local/etc/rc.d/mysql-server [start|stop]. I've ensured that
this script works when run manually, and that it returns 0.
When I restart the offending bacula-fd's (kill doesn't work; requires
kill -9), the offending jobs finish with errors and the rest of my
queued jobs begin to run.
Any ideas what's causing this? Suggestions for fixing it?
Here are some details:
Director OS: FreeBSD 6.2-RELEASE
File Daemon OS: FreeBSD 6.1-RELEASE
bacula-dir/sd: 2.2.4
bacula-fd: 2.2.4
catalog: MySQL 5.0
Total # of jobs: 13, of which...
7 "Priority = 10"
5 "Priority = 12"
1 "Priority = 20"
bacula-dir.conf: Director{Maximum Concurrent Jobs=10;...}
bacula-sd.conf: Storage{Maximum Concurrent Jobs=20;...}
Offending Client Run Before/After Job scripts:
Client Run Before Job = "/usr/local/etc/rc.d/mysql-server stop"
Client Run After Job = "/usr/local/etc/rc.d/mysql-server start"
logged output on the director while job is hung:
zeus-dir: sql_find.c:134 No Job record found: ERR=
CMD=SELECT StartTime FROM Job WHERE JobStatus='T' AND Type='B' AND Level='F'
AND Name='ritin' AND ClientId=5 AND FileSetId=5 ORDER BY StartTime DESC LIMIT 1
zeus-dir: No prior or suitable Full backup found in catalog. Doing FULL
backup.
zeus-dir: Start Backup JobId 31, Job=ritin.2007-10-30_05.05.04
zeus-dir: Created new Volume "ritin-Full-0002" in catalog.
zeus-dir: Using Device "ritinFileStorage"
ritin-fd: ClientRunBeforeJob: run command "/usr/local/etc/rc.d/mysql-server
stop"
ritin-fd: ClientRunBeforeJob: Stopping mysql.
ritin-fd: ClientRunBeforeJob: Waiting for PIDS: 96261.
zeus-sd: Labeled new Volume "ritin-Full-0002" on device "ritinFileStorage"
(/bacula/disk2).
zeus-sd: Wrote label to prelabeled Volume "ritin-Full-0002" on device
"ritinFileStorage" (/bacula/disk2)
zeus-dir: Max Volume jobs exceeded. Marking Volume "ritin-Full-0002" as Used.
ritin-fd: Disallowed filesystem. Will not descend from / into /dev
zeus-sd: Job write elapsed time = 00:57:03, Transfer rate = 9.499 M
bytes/second
ritin-fd: ClientAfterJob: run command "/usr/local/etc/rc.d/mysql-server
start"
ritin-fd: ClientAfterJob: Starting mysql.
'status dir' output while job on 'ritin' is hung (truncated):
Running Jobs:
JobId Level Name Status
======================================================================
31 Full ritin.2007-10-30_05.05.04 has terminated
34 Full comdev.2007-10-30_05.05.07 is waiting for higher priority jobs
to finish
35 Full comstag.2007-10-30_05.05.08 is waiting execution
Dane
--
Dane Miller
Systems Administrator
Great Schools, Inc
http://greatschools.net
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Bacula-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bacula-users