Jeremy Howard wrote:
>
> Lawrence Greenfield wrote:
>
> > Date: Wed, 15 May 2002 16:02:42 -0300
> > From: Henrique de Moraes Holschuh <[EMAIL PROTECTED]>
> >[...]
> > The point is, if that indeed happens, log or no log, master loses track of
> > the number of children that can service requests. That would be a bug, and
> > the patch supposedly fixes this bug. It really doesn't matter (for
> > accepting or not the patch) why the child died.
> >
> >Yes, I understand that. However, if the master (in real life
> >situations) is actually losing track of the number of available
> >service processes without one of those service processes crashing
> >(either by the sysadmin or otherwise) then there's some other problem
> >in the child accounting.
> >
> >
> The child accounting is fine. The problem in our case was always caused
> by child segfaults, or failure to properly close TCP connections. We
> still see segfaults (about one per fifty thousand connections I'd
> guess),
Can you send us a backtrace from a core? If you're not getting a core,
please setup your system to dump one. Here are bits that I use in my
Cyrus startup script on Linux:
cd /var/imap/cores
ulimit -c unlimited
export MALLOC_CHECK_=2
$master &
If you have multiple services/processes the cores will overwrite each
other, so you need to catch it fairly quickly (unless they all have the
same failure).
Ken
--
Kenneth Murchison Oceana Matrix Ltd.
Software Engineer 21 Princeton Place
716-662-8973 x26 Orchard Park, NY 14127
--PGP Public Key-- http://www.oceana.com/~ken/ksm.pgp