THanks Bernard, it could be a spurious operating system thing. Good luck
M bernardchan wrote: > > Hi Mark, > > Yes, I have checked from SVN that the code was not changed for many > revisions, so I also suspected there might be something specific to > those installations and our cfengine configuration, but what is strange > is that I had not experienced similar crashes with our machines with > identical configuration (and software version, by the way), > because all the disks were dd'ed from the same source. I raised it > because I was unsure whether there might be some behaviour I don't > know of that may lead to those crashes. In Case 1 the SIGABRT occurred > while recursive copying a dir to another dir with mixed regular files > and symbolic links. In Case 2 the > SIGSEGV occurred while processing the "processes" section and apparently > checking the ps auxw. > For instance, whether the locale or charset or special filename pattern > may have anything to do with it. > > Yes, they were repeatable on many of our systems only recently, but not some > time before. So probably some of our recent file changes may have triggered > that. I recalled seeing some spurious SIGSEGV on a few other systems but > were not always repeatable there. > > I have briefly compiled a version of 2.2.2 and put it in some temporary > directory on some of our systems with crashes seen with 2.2.1. Up to now > a few runs have been through without crashing, but considering it has > not sustained enough real usage, > At this point I cannot tell for sure whether those problems may pop up > again once I replaced the > systemwide version ( 2.2.1 ) with 2.2.2 and some file changes made which > require copying and > restarting processes. > > Will try to migrate more machines to 2.2.2 and check whether the issue > would go away. > > Thank you again for your reply. > > Regards, > Bernard Chan. > > On Sun, 07 Oct 2007 10:25:41 +0200, Mark Burgess wrote >> Hi Bernard, >> >> thanks for this information. This is a little unusual. In fact this >> is not a SEG fault but an abort signal, which is software generated. >> It comes from file operations, which is code that has not changed >> for several years. This makes me suspect that there could be some >> site-specific reason for this. >> >> Does this happen regularly/repeatably? On the same host, or different >> ones? Would you be willing to try compiling 2.2.2 to see if there are >> any differences? >> >> thanks >> Mark >> >> Bernard Chan wrote: >>> Hello, >>> >>> I experienced various instances of segfaults on some cfengine >>> installations. >>> Following shows the two cases which I encounter so far: >>> >>> Compiler: gcc 3.4.4 >>> Version: cfengine 2.2.1 >>> Linux (Distribubtion: AsteriskNow) >>> >>> CASE 1 >>> >>> (gdb) run -D forceUpdate >>> Starting program: /usr/local/sbin/cfagent -D forceUpdate >>> Detaching after fork from child process 4679. >>> *** glibc detected *** free(): invalid pointer: 0x081772c8 *** >>> >>> Program received signal SIGABRT, Aborted. >>> 0xb7f7f410 in ?? () >>> (gdb) back >>> #0 0xb7f7f410 in ?? () >>> #1 0xbff88560 in ?? () >>> #2 0x00000006 in ?? () >>> #3 0x00001244 in ?? () >>> #4 0xb7c3b275 in raise () from /lib/tls/libc.so.6 >>> #5 0xb7c3ca59 in abort () from /lib/tls/libc.so.6 >>> #6 0xb7c6f19a in __fsetlocking () from /lib/tls/libc.so.6 >>> #7 0xb7c750a7 in malloc_usable_size () from /lib/tls/libc.so.6 >>> #8 0xb7c75abb in free () from /lib/tls/libc.so.6 >>> #9 0xb7c97e08 in closedir () from /lib/tls/libc.so.6 >>> #10 0x0805fe75 in cfclosedir (dirh=0xb7d29e40) at image.c:1086 >>> #11 0x080a162e in RecursiveImage (ip=0x81602f0, >>> from=0xbff92950 "/mnt/asterisksetup", to=0xbff90950 "/etc/asterisk_bak", >>> maxrecurse=-99) at expand-image.c:234 >>> #12 0x08052c25 in MakeImages () at do.c:2548 >>> #13 0x0804de24 in DoTree (passes=3, info=0x80a7afa "Main Tree") >>> at cfagent.c:1328 >>> #14 0x0804ea5f in main (argc=3, argv=0xbff94aa4) at cfagent.c:180 >>> >>> CASE 2 >>> >>> >>> (gdb) run -q -D forceUpdate >>> Starting program: /usr/local/sbin/cfagent -q -D forceUpdate >>> Detaching after fork from child process 6206. >>> >>> Detaching after fork from child process 6207. >>> Detaching after fork from child process 6208. >>> Detaching after fork from child process 6209. >>> Detaching after fork from child process 6210. >>> Detaching after fork from child process 6211. >>> >>> Program received signal SIGSEGV, Segmentation fault. >>> 0xb7c0efee in free () from /lib/tls/libc.so.6 >>> (gdb) back >>> #0 0xb7c0efee in free () from /lib/tls/libc.so.6 >>> #1 0xb7c10701 in malloc () from /lib/tls/libc.so.6 >>> #2 0x0806237d in AppendItem (liststart=0xbfd1b608, >>> itemstring=0x816d130 "root 362 0.0 0.0 0 0 ? >>> S< 10:05 0:00 [cifsoplockd]", classes=0x8199b18 "") at item.c:349 >>> #3 0x080624fd in CopyList (dest=0xbfd1b608, source=0x8193030) at item.c:210 >>> #4 0x0805d513 in LoadProcessTable (procdata=0xbfd1b748, >>> psopts=0x80b15c1 "auxw") at process.c:78 >>> #5 0x0805302d in CheckProcesses () at do.c:2678 >>> #6 0x0804ddc9 in DoTree (passes=3, info=0x80a7afa "Main Tree") >>> at cfagent.c:1348 >>> #7 0x0804ea5f in main (argc=4, argv=0xbfd1b834) at cfagent.c:180 >>> >>> >>> Thanks for creating the cfengine >>> >>> Regards, >>> Bernard Chan. >>> >>> _______________________________________________ >>> Bug-cfengine mailing list >>> [email protected] >>> https://cfengine.org/mailman/listinfo/bug-cfengine >> -- >> Mark Burgess >> >> Professor of Network and System Administration >> Oslo University College >> >> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Work: > +47 22453272 Email: [EMAIL PROTECTED] Fax : +47 22453205 > WWW : http://www.iu.hio.no/~mark > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > _______________________________________________ Bug-cfengine mailing list > [email protected] https://cfengine.org/mailman/listinfo/bug-cfengine > > > -- > PowerAll Networks Ltd (http://www.powerallnetworks.com) > -- Mark Burgess Professor of Network and System Administration Oslo University College ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Work: +47 22453272 Email: [EMAIL PROTECTED] Fax : +47 22453205 WWW : http://www.iu.hio.no/~mark ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ _______________________________________________ Bug-cfengine mailing list [email protected] https://cfengine.org/mailman/listinfo/bug-cfengine
