On Tue, Aug 09, 2016 at 08:39:15AM +1000, Lachlan Musicman wrote: > We are seeing SSSD in a failed state at random intervals. > > Using the 1.14.0 COPR repo on Centos 7, FreeIPA 4.2 > > Unfortunately it's not something we want to reproduce and I'd turned the > debug logs off because of their size. I'm turning them back on one by one > as the crashes happen. > > The only thing we see in the logs when it happens is: > > > (Mon Aug 8 09:39:44 2016) [sssd] [watchdog_handler] (0x0010): Watchdog > timer overflow, killing process! > (Mon Aug 8 09:39:44 2016) [sssd] [orderly_shutdown] (0x0010): SIGTERM: > killing children
This means the sssd process was 'stuck' for some time so that the watchdog killed it. Getting a pstack of that process might be valuable. > > > > Any ideas on what might cause this? > > > Cheers > L. > ------ > The most dangerous phrase in the language is, "We've always done it this > way." > > - Grace Hopper > -- > Manage your subscription for the Freeipa-users mailing list: > https://www.redhat.com/mailman/listinfo/freeipa-users > Go to http://freeipa.org for more info on the project -- Manage your subscription for the Freeipa-users mailing list: https://www.redhat.com/mailman/listinfo/freeipa-users Go to http://freeipa.org for more info on the project
