Package: bind9 Version: 1:9.8.1.dfsg.P1-4.2 Severity: grave Justification: renders package unusable
Hi. Since some update (unfortunately I forgot which one,.. but it's at least months ago) I experiece a memory hole in named. I don't see a certain way to reproduce it but it happens every week (or even more often). dmesg shows the typical: [651271.745183] named invoked oom-killer: gfp_mask=0x201da, order=0, oom_adj=0, oom_score_adj=0 [651271.745488] named cpuset=/ mems_allowed=0 [651271.745612] Pid: 3826, comm: named Not tainted 3.2.0-3-amd64 #1 [651271.745782] Call Trace: [651271.745862] [<ffffffff810b640a>] ? dump_header+0x78/0x1bd [651271.746037] [<ffffffff8116303e>] ? security_real_capable_noaudit+0x40/0x4f [651271.746269] [<ffffffff810b67d3>] ? oom_kill_process+0x49/0x271 [651271.746462] [<ffffffff810b6ece>] ? out_of_memory+0x2ea/0x337 [651271.746635] [<ffffffff810bac76>] ? __alloc_pages_nodemask+0x5e4/0x726 [651271.746822] [<ffffffff810e4094>] ? alloc_pages_current+0xb2/0xcc [651271.747001] [<ffffffff810b3c1e>] ? find_get_page+0x1a/0x63 [651271.747193] [<ffffffff810b5b11>] ? filemap_fault+0x24f/0x33e [651271.747388] [<ffffffff810cdc24>] ? __do_fault+0xc8/0x3ac [651271.747599] [<ffffffff810d019f>] ? handle_pte_fault+0x298/0x79f [651271.747791] [<ffffffff810cd6e9>] ? set_pmd+0x5/0x8 [651271.748074] [<ffffffff810ce7a5>] ? __pte_alloc+0x87/0x11e [651271.748255] [<ffffffff810cd779>] ? pte_offset_kernel+0x16/0x35 [651271.748461] [<ffffffff8134dc83>] ? do_page_fault+0x312/0x337 [651271.748634] [<ffffffff81037ec0>] ? set_next_entity+0x32/0x55 [651271.748804] [<ffffffff8100d755>] ? __switch_to+0x1e5/0x258 [651271.748967] [<ffffffff8103987c>] ? finish_task_switch+0x88/0xba [651271.749408] [<ffffffff81349974>] ? __schedule+0x5ac/0x5c3 [651271.749752] [<ffffffff8134b275>] ? page_fault+0x25/0x30 [651271.750136] Mem-Info: [651271.750424] Node 0 DMA per-cpu: [651271.750745] CPU 0: hi: 0, btch: 1 usd: 0 [651271.751101] Node 0 DMA32 per-cpu: [651271.751424] CPU 0: hi: 186, btch: 31 usd: 91 [651271.751769] active_anon:85944 inactive_anon:85973 isolated_anon:0 [651271.751770] active_file:9 inactive_file:37 isolated_file:0 [651271.751770] unevictable:0 dirty:0 writeback:0 unstable:0 [651271.751771] free:10340 slab_reclaimable:1378 slab_unreclaimable:2703 [651271.751772] mapped:72 shmem:169 pagetables:2187 bounce:0 [651271.753685] Node 0 DMA free:3740kB min:776kB low:968kB high:1164kB active_anon:5864kB inactive_anon:5912kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15688kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:252kB slab_unreclaimable:88kB kernel_stack:0kB pagetables:52kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:516 all_unreclaimable? yes [651271.756066] lowmem_reserve[]: 0 741 741 741 [651271.756455] Node 0 DMA32 free:37620kB min:37716kB low:47144kB high:56572kB active_anon:337912kB inactive_anon:337980kB active_file:36kB inactive_file:148kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:759508kB mlocked:0kB dirty:0kB writeback:0kB mapped:288kB shmem:676kB slab_reclaimable:5260kB slab_unreclaimable:10724kB kernel_stack:2208kB pagetables:8696kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:2216 all_unreclaimable? yes [651271.758858] lowmem_reserve[]: 0 0 0 0 [651271.759222] Node 0 DMA: 1*4kB 1*8kB 7*16kB 5*32kB 18*64kB 2*128kB 0*256kB 0*512kB 0*1024kB 1*2048kB 0*4096kB = 3740kB [651271.760048] Node 0 DMA32: 399*4kB 323*8kB 328*16kB 215*32kB 99*64kB 33*128kB 16*256kB 1*512kB 0*1024kB 1*2048kB 1*4096kB = 37620kB [651271.760888] 10052 total pagecache pages [651271.761212] 9832 pages in swap cache [651271.761535] Swap cache stats: add 878005, delete 868173, find 2062031/2100592 [651271.762142] Free swap = 0kB [651271.762447] Total swap = 1023996kB [651271.796185] 196589 pages RAM [651271.796572] 4843 pages reserved [651271.796904] 510 pages shared [651271.797223] 180610 pages non-shared [651271.797576] [ pid ] uid tgid total_vm rss cpu oom_adj oom_score_adj name [651271.798264] [ 296] 0 296 5424 0 0 -17 -1000 udevd [651271.798968] [ 2762] 0 2762 4739 35 0 0 0 rpcbind [651271.799686] [ 3494] 0 3494 4764 1 0 0 0 starter [651271.800392] [ 3501] 108 3501 98342 104 0 0 0 charon [651271.801094] [ 3593] 0 3593 13403 34 0 0 0 rsyslogd [651271.801807] [ 3630] 0 3630 980 0 0 0 0 acpi_fakekeyd [651271.802521] [ 3745] 0 3745 1058 1 0 0 0 acpid [651271.803218] [ 3795] 0 3795 4164 0 0 0 0 atd [651271.803930] [ 3825] 109 3825 406347 152163 0 0 0 named [651271.804749] [ 3914] 0 3914 22585 11 0 0 0 apache2 [651271.805484] [ 3917] 33 3917 22361 3 0 0 0 apache2 [651271.806169] [ 3941] 33 3941 78568 92 0 0 0 apache2 [651271.806933] [ 3942] 33 3942 78554 50 0 0 0 apache2 [651271.807642] [ 3943] 33 3943 78580 73 0 0 0 apache2 [651271.808369] [ 4042] 0 4042 5505 22 0 0 0 cron [651271.809100] [ 4073] 103 4073 7485 0 0 0 0 dbus-daemon [651271.809976] [ 4101] 0 4101 15505 0 0 0 0 console-kit-dae [651271.810734] [ 4180] 0 4180 14346 1 0 0 0 polkitd [651271.811422] [ 4209] 111 4209 2705 11 0 0 0 epmd [651271.812141] [ 4263] 111 4263 29833 2103 0 0 0 beam [651271.812858] [ 4367] 0 4367 17778 446 0 0 0 fail2ban-server [651271.813592] [ 4380] 0 4380 5572 13 0 0 0 gam_server [651271.814313] [ 4664] 0 4664 3268 0 0 0 0 mcelog [651271.815006] [ 4693] 104 4693 9745 25 0 0 0 ntpd [651271.815716] [ 4763] 114 4763 20941 77 0 -13 -900 postgres [651271.816434] [ 4838] 114 4838 20937 24 0 0 0 postgres [651271.817150] [ 4839] 114 4839 20937 16 0 0 0 postgres [651271.817865] [ 4840] 114 4840 21153 104 0 0 0 postgres [651271.818564] [ 4841] 114 4841 17381 50 0 0 0 postgres [651271.819255] [ 5390] 0 5390 9442 29 0 0 0 master [651271.828196] [ 5409] 105 5409 10000 38 0 0 0 qmgr [651271.828890] [ 5421] 112 5421 19209 3452 0 0 0 sks [651271.829590] [ 5422] 112 5422 11925 2558 0 0 0 sks [651271.830270] [ 5423] 0 5423 1021 0 0 0 0 startpar [651271.830978] [ 5442] 0 5442 12459 12 0 -17 -1000 sshd [651271.831670] [ 5595] 0 5595 4467 2 0 0 0 getty [651271.832362] [ 5596] 0 5596 4467 2 0 0 0 getty [651271.833089] [ 5597] 0 5597 4467 2 0 0 0 getty [651271.833783] [ 5598] 0 5598 4467 2 0 0 0 getty [651271.834477] [ 5599] 0 5599 4467 2 0 0 0 getty [651271.835151] [ 5600] 0 5600 4467 2 0 0 0 getty [651271.835849] [ 5972] 111 5972 2696 1 0 0 0 inet_gethost [651271.836607] [ 5973] 111 5973 4272 2 0 0 0 inet_gethost [651271.837316] [11641] 33 11641 78566 49 0 0 0 apache2 [651271.838060] [11732] 111 11732 4272 24 0 0 0 inet_gethost [651271.838786] [23714] 105 23714 10588 51 0 0 0 tlsmgr [651271.839483] [14407] 111 14407 4272 2 0 0 0 inet_gethost [651271.840230] [10917] 0 10917 1178 38 0 0 0 haveged [651271.840954] [12587] 105 12587 9959 134 0 0 0 pickup [651271.841647] Out of memory: Kill process 3825 (named) score 860 or sacrifice child [651271.842335] Killed process 3825 (named) total-vm:1625388kB, anon-rss:608652kB, file-rss:0kB [651271.845894] fail2ban-server invoked oom-killer: gfp_mask=0x201da, order=0, oom_adj=0, oom_score_adj=0 [651271.846654] fail2ban-server cpuset=/ mems_allowed=0 [651271.847058] Pid: 4391, comm: fail2ban-server Not tainted 3.2.0-3-amd64 #1 [651271.847504] Call Trace: [651271.847815] [<ffffffff810b640a>] ? dump_header+0x78/0x1bd [651271.848242] [<ffffffff8116303e>] ? security_real_capable_noaudit+0x40/0x4f [651271.848709] [<ffffffff810b67d3>] ? oom_kill_process+0x49/0x271 [651271.849172] [<ffffffff810b6ece>] ? out_of_memory+0x2ea/0x337 [651271.849603] [<ffffffff810bac76>] ? __alloc_pages_nodemask+0x5e4/0x726 [651271.850056] [<ffffffff810e4094>] ? alloc_pages_current+0xb2/0xcc [651271.850484] [<ffffffff810b3c1e>] ? find_get_page+0x1a/0x63 [651271.850891] [<ffffffff810b5b11>] ? filemap_fault+0x24f/0x33e [651271.851307] [<ffffffff810cdc24>] ? __do_fault+0xc8/0x3ac [651271.851717] [<ffffffff810d019f>] ? handle_pte_fault+0x298/0x79f [651271.852150] [<ffffffff810cd779>] ? pte_offset_kernel+0x16/0x35 [651271.852587] [<ffffffff8134dc83>] ? do_page_fault+0x312/0x337 [651271.852989] [<ffffffff81065ad7>] ? timekeeping_get_ns+0xd/0x2a [651271.853408] [<ffffffff81065e62>] ? ktime_get_ts+0x5d/0x84 [651271.853838] [<ffffffff81036457>] ? should_resched+0x5/0x23 [651271.854244] [<ffffffff813499cf>] ? _cond_resched+0x7/0x1c [651271.854672] [<ffffffff8110701b>] ? poll_select_copy_remaining+0xda/0xf9 [651271.855111] [<ffffffff8134b275>] ? page_fault+0x25/0x30 [651271.855512] Mem-Info: [651271.855812] Node 0 DMA per-cpu: [651271.856165] CPU 0: hi: 0, btch: 1 usd: 0 [651271.856567] Node 0 DMA32 per-cpu: [651271.856907] CPU 0: hi: 186, btch: 31 usd: 46 [651271.857279] active_anon:85949 inactive_anon:85969 isolated_anon:0 [651271.857280] active_file:9 inactive_file:64 isolated_file:0 [651271.857281] unevictable:0 dirty:1 writeback:0 unstable:0 [651271.857282] free:10354 slab_reclaimable:1378 slab_unreclaimable:2698 [651271.857283] mapped:77 shmem:169 pagetables:2193 bounce:0 [651271.859344] Node 0 DMA free:3740kB min:776kB low:968kB high:1164kB active_anon:5864kB inactive_anon:5912kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15688kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:252kB slab_unreclaimable:88kB kernel_stack:0kB pagetables:56kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:516 all_unreclaimable? yes [651271.862040] lowmem_reserve[]: 0 741 741 741 [651271.862437] Node 0 DMA32 free:37676kB min:37716kB low:47144kB high:56572kB active_anon:337932kB inactive_anon:337964kB active_file:36kB inactive_file:256kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:759508kB mlocked:0kB dirty:4kB writeback:0kB mapped:308kB shmem:676kB slab_reclaimable:5260kB slab_unreclaimable:10704kB kernel_stack:2208kB pagetables:8716kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:199 all_unreclaimable? yes [651271.865153] lowmem_reserve[]: 0 0 0 0 [651271.865690] Node 0 DMA: 1*4kB 1*8kB 7*16kB 5*32kB 18*64kB 2*128kB 0*256kB 0*512kB 0*1024kB 1*2048kB 0*4096kB = 3740kB [651271.866606] Node 0 DMA32: 427*4kB 324*8kB 324*16kB 215*32kB 99*64kB 33*128kB 16*256kB 1*512kB 0*1024kB 1*2048kB 1*4096kB = 37676kB [651271.867547] 10093 total pagecache pages [651271.867900] 9840 pages in swap cache [651271.868262] Swap cache stats: add 878013, delete 868173, find 2062031/2100593 [651271.868945] Free swap = 0kB [651271.869257] Total swap = 1023996kB [651271.872476] 196589 pages RAM [651271.872811] 4843 pages reserved [651271.873136] 515 pages shared [651271.873463] 180640 pages non-shared [651271.873827] [ pid ] uid tgid total_vm rss cpu oom_adj oom_score_adj name [651271.874527] [ 296] 0 296 5424 0 0 -17 -1000 udevd [651271.875227] [ 2762] 0 2762 4739 35 0 0 0 rpcbind [651271.875943] [ 3494] 0 3494 4764 1 0 0 0 starter [651271.876674] [ 3501] 108 3501 98342 104 0 0 0 charon [651271.877356] [ 3593] 0 3593 13403 34 0 0 0 rsyslogd [651271.878076] [ 3630] 0 3630 980 0 0 0 0 acpi_fakekeyd [651271.878785] [ 3745] 0 3745 1058 1 0 0 0 acpid [651271.879483] [ 3795] 0 3795 4164 0 0 0 0 atd [651271.880175] [ 3826] 109 3825 406347 152200 0 0 0 named [651271.880863] [ 3914] 0 3914 22585 11 0 0 0 apache2 [651271.881548] [ 3917] 33 3917 22361 3 0 0 0 apache2 [651271.882236] [ 3941] 33 3941 78568 92 0 0 0 apache2 [651271.882939] [ 3942] 33 3942 78554 50 0 0 0 apache2 [651271.883631] [ 3943] 33 3943 78580 73 0 0 0 apache2 [651271.884345] [ 4042] 0 4042 5505 22 0 0 0 cron [651271.885049] [ 4073] 103 4073 7485 0 0 0 0 dbus-daemon [651271.885743] [ 4101] 0 4101 15505 0 0 0 0 console-kit-dae [651271.886491] [ 4180] 0 4180 14346 1 0 0 0 polkitd [651271.887179] [ 4209] 111 4209 2705 11 0 0 0 epmd [651271.887873] [ 4263] 111 4263 29833 2103 0 0 0 beam [651271.888578] [ 4367] 0 4367 17778 446 0 0 0 fail2ban-server [651271.889308] [ 4380] 0 4380 5572 13 0 0 0 gam_server [651271.890015] [ 4664] 0 4664 3268 0 0 0 0 mcelog [651271.890697] [ 4693] 104 4693 9745 25 0 0 0 ntpd [651271.891399] [ 4763] 114 4763 20941 77 0 -13 -900 postgres [651271.892140] [ 4838] 114 4838 20937 24 0 0 0 postgres [651271.892859] [ 4839] 114 4839 20937 16 0 0 0 postgres [651271.893573] [ 4840] 114 4840 21153 104 0 0 0 postgres [651271.894249] [ 4841] 114 4841 17381 50 0 0 0 postgres [651271.894950] [ 5390] 0 5390 9442 29 0 0 0 master [651271.895625] [ 5409] 105 5409 10000 38 0 0 0 qmgr [651271.896339] [ 5421] 112 5421 19209 3452 0 0 0 sks [651271.897025] [ 5422] 112 5422 11925 2558 0 0 0 sks [651271.897707] [ 5423] 0 5423 1021 0 0 0 0 startpar [651271.898403] [ 5442] 0 5442 12459 12 0 -17 -1000 sshd [651271.899094] [ 5595] 0 5595 4467 2 0 0 0 getty [651271.899806] [ 5596] 0 5596 4467 2 0 0 0 getty [651271.908612] [ 5597] 0 5597 4467 2 0 0 0 getty [651271.909301] [ 5598] 0 5598 4467 2 0 0 0 getty [651271.909984] [ 5599] 0 5599 4467 2 0 0 0 getty [651271.910681] [ 5600] 0 5600 4467 2 0 0 0 getty [651271.911349] [ 5972] 111 5972 2696 1 0 0 0 inet_gethost [651271.912090] [ 5973] 111 5973 4272 2 0 0 0 inet_gethost [651271.912804] [11641] 33 11641 78566 49 0 0 0 apache2 [651271.913509] [11732] 111 11732 4272 24 0 0 0 inet_gethost [651271.914215] [23714] 105 23714 10588 51 0 0 0 tlsmgr [651271.915043] [14407] 111 14407 4272 2 0 0 0 inet_gethost [651271.915751] [10917] 0 10917 1178 38 0 0 0 haveged [651271.916480] [12587] 105 12587 9959 134 0 0 0 pickup [651271.917181] Out of memory: Kill process 3826 (named) score 860 or sacrifice child [651271.917879] Killed process 3826 (named) total-vm:1625388kB, anon-rss:608796kB, file-rss:4kB Now I only serve a few very very small zones and when bind stard it needs only about 100M. The node is not one with super much memory (~750M) but it has 1G swap ... and both should be plenty enough to run a small nameserver. There's nothing specific in the bind log: ... 06-Oct-2012 00:14:30.392 resolver: notice: clients-per-query decreased to 34 06-Oct-2012 00:16:34.424 database: notice: acache 0x7fe1788c5550 stats: hits=357858 misses=14969 queries=372827 adds=14967 deleted=14984 cleaned=14984 cleaner_runs=203 overmem=0 overmem_nocreates=2 nomem=0 06-Oct-2012 00:16:34.426 database: notice: acache 0x7fe1788c5550 cleaning interval set to 3600. 06-Oct-2012 00:30:58.520 database: notice: acache 0x7fe1788c5550 stats: hits=375465 misses=14990 queries=390455 adds=14982 deleted=14989 cleaned=14989 cleaner_runs=204 overmem=0 overmem_nocreates=8 nomem=0 06-Oct-2012 00:30:58.524 database: notice: acache 0x7fe1788c5550 cleaning interval set to 3600. then it crashed and I restared at: 06-Oct-2012 01:24:03.773 general: info: zone ***.aq/IN: loaded serial 2010120401 Any ideas? Or advise how/what I could check further? Thanks, Chris. -- System Information: Debian Release: wheezy/sid APT prefers unstable APT policy: (500, 'unstable') Architecture: amd64 (x86_64) Kernel: Linux 3.5-trunk-amd64 (SMP w/8 CPU cores) Locale: LANG=en_DE.UTF-8, LC_CTYPE=en_DE.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org