Apparently lundmark has ECC memory problems: lundmark login: [13906.806163] Synchronous External Abort: synchronous parity or ECC error (0x96000018) at 0x0000ffffa1a36000 [13906.819864] Internal error: : 96000018 [#2] SMP [13906.826338] Modules linked in: nls_iso8859_1 i2c_thunderx thunderx_edac i2c_smbus thunderx_zip gpio_keys shpchp cavium_rng_vf cavium_rng uio_pdrv_gen irq ipmi_ssif uio ipmi_devintf ipmi_msghandler ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi autofs4 btrfs ra id10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear nicvf nicpf ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt aes_ce_blk fb_sys_fops aes_ce_cipher crc32_ce drm crct10dif_ce ghash_ce sha2_ce sha1_ce ahci libah ci thunder_bgx thunder_xcv mdio_thunder thunderx_mmc mdio_cavium aes_neon_bs aes_neon_blk crypto_simd cryptd [13906.907111] CPU: 0 PID: 43189 Comm: cc1 Tainted: G D 4.13.0-33-generic #36~kpti414backport [13906.921356] Hardware name: Cavium ThunderX CRB/To be filled by O.E.M., BIOS 5.11 12/12/2012 [13906.932232] task: ffff801ebfb65a00 task.stack: ffff000017938000 [13906.940682] PC is at __arch_copy_to_user+0x140/0x248 [13906.948211] LR is at cp_new_stat+0x188/0x198 [13906.954972] pc : [<ffff000008a82040>] lr : [<ffff0000082d10d8>] pstate: 80000145 [13906.964884] sp : ffff00001793bd40 [13906.970651] x29: ffff00001793bd40 x28: ffff801ebfb65a00 [13906.978453] x27: ffff000008ab2000 x26: 0000000000000050 [13906.986192] x25: 0000000000000124 x24: 0000000000000015 [13906.993908] x23: 0000000000000000 x22: 000000001793bd98 [13907.001571] x21: ffff801ebfb65a00 x20: ffff0000093d8000 [13907.009221] x19: ffff00001793be30 x18: 0000000000000000 [13907.016833] x17: 0000ffffa2987748 x16: ffff0000082d17a0 [13907.024426] x15: ffffffffffffffff x14: ff00000000000000 [13907.032072] x13: ffffffffffffffff x12: 0000000000000002 [13907.039698] x11: 0000000000000004 x10: 0000000000000000 [13907.047360] x9 : 000003e8000003e8 x8 : 00000001000081b4 [13907.054885] x7 : 00000000019a0839 x6 : 000000001793bdb0 [13907.062466] x5 : 000000001793be18 x4 : 0000000000000008 [13907.070147] x3 : 0000000000000802 x2 : fffffffffffffff8 [13907.077750] x1 : ffff00001793bda0 x0 : 000000001793bd98 [13907.085277] Process cc1 (pid: 43189, stack limit = 0xffff000017938000) [13907.094022] Stack: (0xffff00001793bd40 to 0xffff00001793c000) [22/7865] [13907.102004] bd40: ffff00001793be00 ffff0000082d17f8 ffff0000093d8000 0000000000000004 [13907.112119] bd60: 000000001793bd98 0000ffffa298775c ffff801ce388f710 0000000000000802 [13907.122311] bd80: 00000000019a0839 00000001000081b4 000003e8000003e8 0000000000000000 [13907.132302] bda0: 0000000000000000 000000000000075c 0000000000001000 0000000000000008 [13907.142201] bdc0: 000000005a857e32 000000001eb68da6 000000005a857a9a 000000002a0b1064 [13907.152053] bde0: 000000005a857a9a 000000002a0b1064 0000000000000000 0000000000040d00 [13907.161858] be00: ffff00001793bff0 ffff000008083c00 ffffffffffffff2c 0000801f7375f000 [13907.171601] be20: 00000000ffffffff 0000000017940200 000081b400000fff 0000100000000001 [13907.181425] be40: 0000000000000000 0000000000000874 00000000019a0839 0000000000800002 [13907.191582] be60: 000003e8000003e8 000000000000075c 000000005a857e32 000000001eb68da6 [13907.201202] be80: 000000005a857a9a 000000002a0b1064 000000005a857a9a 000000002a0b1064 [13907.210827] bea0: 000000005a857a9a 000000002a0b1064 0000000000000008 0000000000040d00 [13907.220314] bec0: 0000000000000004 000000001793bd98 000000001793bd98 0000000000000000 [13907.229797] bee0: 00000000178ddb40 0000000000000006 00000000000001fd 0000000000000008 [13907.239364] bf00: 0000000000000050 0000000000000004 0101010101010101 0000000000000004 [13907.248763] bf20: 0000000000000002 ffffffffffffffff ff00000000000000 ffffffffffffffff [13907.258136] bf40: 000000000118b008 0000ffffa2987748 0000000000000000 000000001793bd50 [13907.267557] bf60: 000000001793bd50 0000000017819fc0 0000000017940200 000000001791ff70 [13907.276998] bf80: 000000001789ef90 00000000505fd91e 0000000000000000 0000000000000001 [13907.286323] bfa0: 0000000000000000 0000ffffda518ce0 0000000000d9b3ac 0000ffffda518ce0 [13907.295739] bfc0: 0000ffffa298775c 0000000000000000 0000000000000004 0000000000000050 [13907.305097] bfe0: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [13907.314448] Call trace: [13907.318490] Exception stack(0xffff00001793bc00 to 0xffff00001793bd40) [13907.326649] bc00: 000000001793bd98 ffff00001793bda0 fffffffffffffff8 0000000000000802 [13907.336198] bc20: 0000000000000008 000000001793be18 000000001793bdb0 00000000019a0839 [13907.345688] bc40: 00000001000081b4 000003e8000003e8 0000000000000000 0000000000000004 [13907.355204] bc60: 0000000000000002 ffffffffffffffff ff00000000000000 ffffffffffffffff [13907.364845] bc80: ffff0000082d17a0 0000ffffa2987748 0000000000000000 ffff00001793be30 [13907.374415] bca0: ffff0000093d8000 ffff801ebfb65a00 000000001793bd98 0000000000000000 [13907.383982] bcc0: 0000000000000015 0000000000000124 0000000000000050 ffff000008ab2000 [13907.393622] bce0: ffff801ebfb65a00 ffff00001793bd40 ffff0000082d10d8 ffff00001793bd40 [13907.403244] bd00: ffff000008a82040 0000000080000145 ffff00001793bd30 ffff0000083930e4 [13907.412890] bd20: 0000ffffffffffff ffff0000082d1018 ffff00001793bd40 ffff000008a82040 [13907.422611] [<ffff000008a82040>] __arch_copy_to_user+0x140/0x248 [13907.430452] [<ffff0000082d17f8>] SyS_newfstat+0x58/0x88 [13907.437420] Exception stack(0xffff00001793bec0 to 0xffff00001793c000) [13907.445694] bec0: 0000000000000004 000000001793bd98 000000001793bd98 0000000000000000 [13907.455404] bee0: 00000000178ddb40 0000000000000006 00000000000001fd 0000000000000008 [13907.464995] bf00: 0000000000000050 0000000000000004 0101010101010101 0000000000000004 [13907.474690] bf20: 0000000000000002 ffffffffffffffff ff00000000000000 ffffffffffffffff [13907.484298] bf40: 000000000118b008 0000ffffa2987748 0000000000000000 000000001793bd50 [13907.494060] bf60: 000000001793bd50 0000000017819fc0 0000000017940200 000000001791ff70 [13907.503781] bf80: 000000001789ef90 00000000505fd91e 0000000000000000 0000000000000001 [13907.513428] bfa0: 0000000000000000 0000ffffda518ce0 0000000000d9b3ac 0000ffffda518ce0 [13907.523099] bfc0: 0000ffffa298775c 0000000000000000 0000000000000004 0000000000000050 [13907.532754] bfe0: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [13907.542408] [<ffff000008083c00>] el0_svc_naked+0x34/0x38 [13907.549542] Code: a88120c7 d503201f d503201f a8c12829 (a8c1302b) [13907.557472] ---[ end trace d53e2d07571b0a7e ]---
-- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1749685 Title: Kernel panic on ThunderX Status in linux package in Ubuntu: Incomplete Bug description: While doing testing on lundmark, i observed (from time to time) panics on 4.13.0-32.35~16.04.1-generic - i got this one while deploying the board: Booting under MAAS direction... [ grub.cfg-40:8d:5c:ba 606B 100% 1.56KiB/s ] EFI stub: Booting Linux Kernel... [ boot-initrd 46.78MiB 100% 6.57MiB/s ] EFI stub: EFI_RNG_PROTOCOL unavailable, no randomness supplied EFI stub: Using DTB from configuration table EFI stub: Exiting boot services and installing virtual address map... [ 0.000000] Booting Linux on physical CPU 0x0 [ 0.000000] random: get_random_bytes called from start_kernel+0x50/0x460 with crng_init=0 [ 0.000000] Linux version 4.13.0-32-generic (buildd@bos01-arm64-018) (gcc version 5.4.0 20160609 (Ubuntu/Linaro 5.4.0-6ubuntu1~16.04.5)) #35~16.04.1- Ubuntu SMP Thu Jan 25 10:10:26 UTC 2018 (Ubuntu 4.13.0-32.35~16.04.1-generic 4.13.13) [ 0.000000] Boot CPU: AArch64 Processor [431f0a11] [ 0.000000] Machine model: cavium,thunder-88xx [ 0.000000] efi: Getting EFI parameters from FDT: [ 0.000000] efi: EFI v2.40 by American Megatrends [ 0.000000] efi: ESRT=0x1ffce5ac18 SMBIOS 3.0=0x1ffce5a918 ACPI 2.0=0x1ffeb46000 [ 0.000000] esrt: Reserving ESRT space from 0x0000001ffce5ac18 to 0x0000001ffce5ac50. [ 0.000000] NUMA: NODE_DATA [mem 0x1fff0c4d00-0x1fff0c7fff] [ 0.000000] Zone ranges: [ 0.000000] DMA [mem 0x0000000000500000-0x00000000ffffffff] [ 0.000000] Normal [mem 0x0000000100000000-0x0000001fff0fffff] [ 0.000000] Movable zone start for each node [ 0.000000] Early memory node ranges ... [ 0.000000] Kernel command line: BOOT_IMAGE=ubuntu/arm64/hwe-16.04/xenial/daily/boot-kernel nomodeset root=squash:http://10.229.32.21:5248/images/ubu ntu/arm64/hwe-16.04/xenial/daily/squashfs ro ip=::::lundmark:BOOTIF ip6=off overlayroot=tmpfs overlayroot_cfgdisk=disabled cc:{datasource_list: [MAAS]}e nd_cc cloud-config-url=http://10.229.32.21:5240/MAAS/metadata/latest/by-id/ttctk4/?op=get_preseed apparmor=0 log_host=10.229.32.21 log_port=514 BOOTIF=0 1-40:8d:5c:ba:cd:d4 ... [ 9.058541] Synchronous External Abort: synchronous parity or ECC error (0x86000018) at 0x0000ffff9658fc9c [ 9.058545] Internal error: : 86000018 [#1] SMP [ 9.058548] Modules linked in: ast(+) i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect aes_ce_blk sysimgblt aes_ce_cipher fb_sys_fops crc32_ce crct10dif_ce drm ghash_ce sha2_ce sha1_ce ahci libahci thunder_bgx(+) i2c_thunderx(+) thunder_xcv i2c_smbus ipmi_ssif mdio_thunder thunderx_mmc mdio_ca vium ipmi_devintf ipmi_msghandler aes_neon_bs aes_neon_blk crypto_simd cryptd [ 9.058588] CPU: 7 PID: 0 Comm: swapper/7 Not tainted 4.13.0-32-generic #35~16.04.1-Ubuntu [ 9.058589] Hardware name: Cavium ThunderX CRB/To be filled by O.E.M., BIOS 5.11 12/12/2012 [ 9.058591] task: ffff801f700c6900 task.stack: ffff801f700cc000 [ 9.058600] PC is at __remove_hrtimer+0x48/0xa8 [ 9.058602] LR is at __remove_hrtimer+0x5c/0xa8 [ 9.058604] pc : [<ffff000008153d88>] lr : [<ffff000008153d9c>] pstate: 004001c5 [ 9.058606] sp : ffff801f79787e60 [ 9.058607] x29: ffff801f79787e60 x28: ffff801f700c6900 [ 9.058611] x27: 000000021bc7a0ca x26: ffff000008fcd000 [ 9.058614] x25: 0000000000000001 x24: ffff000008fcd000 [ 9.058617] x23: ffff0000093b9658 x22: ffff801f7978f598 [ 9.058620] x21: ffff801f7978f5c0 x20: ffff801f7978f580 [ 9.058624] x19: ffff801f7978fa00 x18: 0000ffffc8e8cb78 [ 9.058627] x17: 000000000000668a x16: 0000000000000000 [ 9.058630] x15: 0000ffff968adcc0 x14: 343030302c333030 [ 9.058633] x13: 302c323030302c31 x12: 3030302c30303030 [ 9.058636] x11: 0000aaaac635fa10 x10: 0000000000000b00 [ 9.058639] x9 : 0000000000000040 x8 : ffff801f780026f0 [ 9.058643] x7 : 0000000000000000 x6 : ffff801f7978fa00 [ 9.058646] x5 : 0000000000000000 x4 : ffff801f7978fb58 [ 9.058649] x3 : ffff801f7978fb58 x2 : ffff801f7978fb58 [ 9.058652] x1 : ffff801f7978f5d0 x0 : 0000000000000001 [ 9.058656] Process swapper/7 (pid: 0, stack limit = 0xffff801f700cc000) [ 9.058658] Stack: (0xffff801f79787e60 to 0xffff801f700d0000) [ 9.058660] Call trace: [ 9.058662] Exception stack(0xffff801f79787c70 to 0xffff801f79787da0) [ 9.058665] 7c60: ffff801f7978fa00 0001000000000000 [ 9.058668] 7c80: 000000000242d000 ffff000008153d88 00000000004001c5 ffff801f79787d38 [ 9.058671] 7ca0: ffff801f79787cd0 ffff0000081070a4 ffff801f79787cd0 ffff0000081070b4 [ 9.058674] 7cc0: ffff801f6f1b9e00 ffff0000093b8c08 ffff801f79787d50 ffff000008107318 [ 9.058677] 7ce0: ffff801f6f1b9e00 ffff801f79791c10 ffff801f79795020 0000000000000000 [ 9.058680] 7d00: 0000000000000100 0000000000000007 ffff000009555658 ffff0000093b8000 [ 9.058682] 7d20: ffff801f79787d50 0000000000040d00 0000000000000001 ffff801f7978f5d0 [ 9.058685] 7d40: ffff801f7978fb58 ffff801f7978fb58 ffff801f7978fb58 0000000000000000 [ 9.058688] 7d60: ffff801f7978fa00 0000000000000000 ffff801f780026f0 0000000000000040 [ 9.058691] 7d80: 0000000000000b00 0000aaaac635fa10 3030302c30303030 302c323030302c31 [ 9.058695] [<ffff000008153d88>] __remove_hrtimer+0x48/0xa8 [ 9.058697] [<ffff000008153f74>] __hrtimer_run_queues+0xbc/0x2a8 [ 9.058700] [<ffff000008154b08>] hrtimer_interrupt+0xa8/0x228 [ 9.058707] [<ffff0000088cde24>] arch_timer_handler_phys+0x3c/0x50 [ 9.058711] [<ffff00000813dbc4>] handle_percpu_devid_irq+0x8c/0x230 [ 9.058714] [<ffff000008137914>] generic_handle_irq+0x34/0x50 [ 9.058716] [<ffff000008138018>] __handle_domain_irq+0x68/0xc0 [ 9.058719] [<ffff0000080816ec>] gic_handle_irq+0xcc/0x188 [ 9.058721] Exception stack(0xffff801f700cfe00 to 0xffff801f700cff30) [ 9.058724] fe00: ffff000008fcd000 0000000000000000 0000000000000000 ffff000008fd6000 [ 9.058727] fe20: 0000801f707b7000 ffff801f700cff20 0000801f707b7000 ffff0000093b8698 [ 9.058729] fe40: 0000000000000000 ffff801f700cfe90 0000000000000b00 0000aaaac635fa10 [ 9.058732] fe60: 3030302c30303030 302c323030302c31 343030302c333030 0000ffff968adcc0 [ 9.058735] fe80: 0000000000000000 0000000000006688 0000ffffc8e8cb78 ffff000008fcd000 [ 9.058738] fea0: ffff0000093b9658 ffff0000093b9000 ffff000008fdd348 0000000000000000 [ 9.058741] fec0: 0000000000000000 ffff801f700c6900 0000000000000000 0000000000000000 [ 9.058743] fee0: 0000000000000000 ffff801f700cff30 ffff0000080859bc ffff801f700cff30 [ 9.058746] ff00: ffff0000080859c0 0000000000400145 ffff801f700cff20 ffff0000081489d8 [ 9.058748] ff20: ffffffffffffffff ffff000008148a54 [ 9.058751] [<ffff00000808315c>] el1_irq+0xdc/0x180 [ 9.058754] [<ffff0000080859c0>] arch_cpu_idle+0x30/0x168 [ 9.058760] [<ffff000008122bc4>] do_idle+0x114/0x1e0 [ 9.058763] [<ffff000008122e64>] cpu_startup_entry+0x2c/0x30 [ 9.058767] [<ffff000008092308>] secondary_start_kernel+0x108/0x118 [ 9.058770] [<00000000018781c4>] 0x18781c4 [ 9.058773] Code: 370000c0 a94153f3 a9425bf5 f9401bf7 (a8c47bfd) [ 9.058803] ---[ end trace 963acec48f21d263 ]--- [ 9.058805] Kernel panic - not syncing: Fatal exception in interrupt [ 9.058824] SMP: stopping secondary CPUs [ 9.059063] Kernel Offset: disabled [ 9.059066] CPU features: 0x101108 [ 9.059067] Memory Limit: none [ 9.656325] ---[ end Kernel panic - not syncing: Fatal exception in interrupt To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1749685/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp