Sorry. I forgot to mention MySQL 4. Its still responding. I've tested it
while the jobs were hung. Also, if I cancel the hung job, the next tape job
in queue starts and completes just fine.
-Shon
On Wed, Feb 18, 2009 at 4:13 PM, Ryan Novosielski <[email protected]>wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Mingus Dew wrote:
> > Hi all,
> > Been using Bacula 2.4.2 on Solaris 10_x86 for almost 2 years now.
> > Recently tape backups have been entering into a state that I can only
> > describe as "limbo".
> >
> > If I check the status of the director, I may see something like
> >
> > Running Jobs:
> > JobId Level Name Status
> > ======================================================================
> > 22649 Increme RMAN_A_Lvl1_Tape.2009-02-17_13.30.36 is running
> > 22650 Increme RMAN_B_Lvl1_Tape.2009-02-17_13.30.38 is waiting on max
> > Storage jobs
> > 22651 Increme RMAN_PROD_Lvl1_Tape.2009-02-17_14.00.40 is waiting on
> > max Storage jobs
> > 22652 Increme RMAN_BI_Lvl1_Tape.2009-02-17_14.00.42 is waiting on max
> > Storage jobs
> > 22653 Increme RMAN_COG_Lvl1_Tape.2009-02-17_14.00.44 is waiting on max
> > Storage jobs
> >
> > If I check the status of the running jobid or the tape device, it will
> > show this:
> >
> > Used Volume status:
> > B00046 on device "Ultrium-TD3" (/dev/rmt/0cbn)
> > Reader=0 writers=0 devres=0 volinuse=1
> > ====
> >
> > Data spooling: 0 active jobs, 0 bytes; 80 total jobs, 47,799,329,608 max
> > bytes/job.
> > Attr spooling: 0 active jobs, 0 bytes; 80 total jobs, 40,616 max bytes.
> >
> > Basically, tape is mounted and reserved, job is showing a "is running"
> > status, but nothing is happening. Because I lack any monitoring of how
> > long jobs have been running,
> > these have sat for as many as 3 days without changing status, erroring,
> > or completing. This backs up subsequent jobs that have been waiting for
> > the tape device.
> > The only commonality that I've seen is that they are tape jobs. Other
> > than that, the level, fileset, etc. are different.
> >
> > On one occasion when I cancelled one of these long running jobs, I got
> > an error
> >
> > Hostname : BUG!
> > Date : 2009-02-11 14:00:30
> > Severity : err
> >
> > unregister_watchdog_unlocked called before start_watchdog
> >
> >
> > Hostname : BUG!
> > Date : 2009-02-11 14:00:30
> > Severity : err
> >
> > bacula-dir[20200]: [ID 702911 daemon.error] backup4.director: ABORTING
> > due to ERROR in watchdog.c:206
> >
> > If anyone has any advice on what might be happening, I would really
> > appreciate your responses.
>
> Check to see what, if anything, your backend database is doing. You
> don't tell us what it is, so I can't be any more specific.
>
> - --
> ---- _ _ _ _ ___ _ _ _
> |Y#| | | |\/| | \ |\ | | |Ryan Novosielski - Systems Programmer II
> |$&| |__| | | |__/ | \| _| |[email protected] - 973/972.0922 (2-0922)
> \__/ Univ. of Med. and Dent.|IST/CST - NJMS Medical Science Bldg - C630
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.9 (GNU/Linux)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
>
> iEYEARECAAYFAkmcee8ACgkQmb+gadEcsb7DFwCgsSkpcfe1yenkadAjrZwH0nhf
> hVcAoNI3/Xjl7F59nl/uIEQE5/qDQfmx
> =l5KJ
> -----END PGP SIGNATURE-----
>
>
> ------------------------------------------------------------------------------
> Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco,
> CA
> -OSBC tackles the biggest issue in open source: Open Sourcing the
> Enterprise
> -Strategies to boost innovation and cut costs with open source
> participation
> -Receive a $600 discount off the registration fee with the source code:
> SFAD
> http://p.sf.net/sfu/XcvMzF8H
> _______________________________________________
> Bacula-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/bacula-users
>
>
------------------------------------------------------------------------------
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
_______________________________________________
Bacula-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bacula-users