Looks like you are having a socket timeout error. Put in the "Heartbeat
Interval" on the director config so it will keep the channel open and
see if that helps.
Jason
Michael Proto wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Hello all,
>
> I'm seeing a strange problem with my bacula-fd clients after upgrading
> all of my systems to v2.0.3 (client and server). Intermittently, when
> performing a backup of some random client I'll see the following error
> in the Director:
>
> 30-Mar 02:09 archive2-dir: Start Backup JobId 1046,
> Job=guildenstern-a.2007-03-30_01.05.43
> 30-Mar 02:09 archive2-dir: guildenstern-a.2007-03-30_01.05.43 Fatal
> error: Socket error on Storage command: ERR=No data available
> 30-Mar 02:09 archive2-dir: guildenstern-a.2007-03-30_01.05.43 Error:
> Bacula 2.0.3 (06Mar07): 30-Mar-2007 02:09:12
> JobId: 1046
> Job: guildenstern-a.2007-03-30_01.05.43
> Backup Level: Incremental, since=2007-03-29 02:06:06
> Client: "guildenstern-a-fd" 2.0.3 (06Mar07)
> i686-pc-linux-gnu,debian,3.1
> FileSet: "guildenstern" 2007-03-18 21:37:31
> Pool: "Daily" (From Run pool override)
> Storage: "ADIC-Library1" (From Job resource)
> Scheduled time: 30-Mar-2007 01:05:42
> Start time: 30-Mar-2007 02:09:05
> End time: 30-Mar-2007 02:09:12
> Elapsed time: 7 secs
> Priority: 10
> FD Files Written: 0
> SD Files Written: 0
> FD Bytes Written: 0 (0 B)
> SD Bytes Written: 0 (0 B)
> Rate: 0.0 KB/s
> Software Compression: None
> VSS: no
> Encryption: no
> Volume name(s):
> Volume Session Id: 44
> Volume Session Time: 1175201849
> Last Volume Bytes: 204,618,000,384 (204.6 GB)
> Non-fatal FD errors: 0
> SD Errors: 0
> FD termination status:
> SD termination status: Error
> Termination: *** Backup Error ***
>
> If I re-run the job just after the failure, the client works as
> expected. I have about 80 clients, all different platforms (Linux,
> FreeBSD, and Windows), and this seems to only affect the Linux clients.
> Of those Linux clients that are failing it occurs on a variety of
> distributions/versions (Debian v3.0 & v3.1, RHEL v3 & v4) and its
> hit-or-miss whether a given Linux client will work on the first try or
> not, but in all cases I've seen (thus far), the re-run job works fine.
> Some days, a given client will work on the first try, and then the next
> day it fail, then work again the following day, etc... I determined any
> rhyme-or-reason to it other than its just Linux clients that are
> affected. Currently, about 30% of my clients on a given day exhibit this
> behavior.
>
> To work around the problem I've added the following entries to the
> default job resource:
>
> JobDefs {
> Name = "DefaultJob"
> Type = Backup
> Reschedule On Error = yes
> Reschedule Times = 3
> Reschedule Interval = 90 seconds
> ...
>
> This does help my regularly-scheduled jobs to complete without having to
> manually re-run them, but this is not ideal and I'd like to determine
> why the first backup of a given client is failing.
>
> I built and packaged all the Bacula Linux clients myself (so they all
> pull from the same set of config files for quick installation), and I
> used the following compile-time flags when building them:
>
> - --with-openssl --enable-client-only --enable-static-fd --enable-smartalloc
>
> I'm using the static-bacula-fd binary (instead of the bacula-fd binary)
> for maximum portability. They were built on a Debian Sarge host and then
> packaged into appropriate distribution packages.
>
> On one of the often-affected hosts I now have the client started with
> the following flags (out of /etc/inittab):
>
> /sbin/static-bacula-fd -fvc -d100 /etc/bacula/bacula-fd.conf
>
>> /tmp/bacula-fd.out
>>
>
> When the client fails, I see the modification timestamp update on the
> resultant /tmp/bacula-fd.out file, but its currently empty. Do I need to
> redirect stderr to this file instead of stdout?
>
> Anyone have any ideas what might be causing these errors or how I can go
> about debugging this unusual (and while not critical, still very
> annoying) problem?
>
>
>
> Thanks!
> Michael Proto
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.7 (FreeBSD)
>
> iD8DBQFGEX3TOLq/wl1XW74RAmOeAJ9U9+O6kNDDp3LBVGyBHvD7Lt+JvgCdFsrI
> f8IzD/gUPS0/F4dGgeIZ7J4=
> =NcOC
> -----END PGP SIGNATURE-----
>
> -------------------------------------------------------------------------
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the chance to share your
> opinions on IT & business topics through brief surveys-and earn cash
> http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
> _______________________________________________
> Bacula-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/bacula-users
>
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Bacula-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bacula-users