------- Comment From frede...@fr.ibm.com 2018-01-12 08:59 EDT------- Hi, Aneesh, right, I missed the 2 other ones. I rebuilt a kernel with the 3 patches, thanks for having thought of this, because that fixes the issue. Some details now : actually this issue happens only when disabling RPT ("disable_radix") which is not the default in Artful. Also I could have it on Witherspoon systems but not on Boston (having an other issue, which may hide this one... FYI the kernel loops infinitly displaying : [ 0.000000] Allocated bitmap for 2040 MSIs (base IRQ 0x1fd000) [ 0.000000] Initializing IODA2 PHB (/pciex@620c3c0300000) [ 0.000000] PCI host bridge /pciex@620c3c0300000 ranges: [ 0.0000[ 89.349426896,3] opalmsg: No available node in the free list, allocating [ 89.352371632,3] opalmsg: No available node in the free list, allocating 00] M[ 89.355953552,3] opalmsg: No available node in the free list, allocating [ 89.359518752,3] opalmsg: No available node in the free list, allocating [ 89.363009568,3] opalmsg: No available node in the free list, allocating [ 89.366608224,3] opalmsg: No available node in the free list, allocating [ 89.370155632,3] opalmsg: No available node in the free list, allocating [ 89.373689632,3] opalmsg: No available node in the free list, allocating [ 89.376579088,3] opalmsg: No available node in the free list, allocating [ 89.380134464,3] opalmsg: No available node in the free list, allocating [ 89.383665744,3] opalmsg: No available node in the free list, allocating [ 89.387155088,3] opalmsg: No available node in the free list, allocating [ 89.390741040,3] opalmsg: No available node in the free list, allocating [ 89.393600592,3] opalmsg: No available node in the free list, allocating [ 89.397146720,3] opalmsg: No available node in the free list, allocating [ 89.400687600,3] opalmsg: No available node in the free list, allocating [ 89.404226032,3] opalmsg: No available node in the free list, allocating [ 89.407772816,3] opalmsg: No available node in the free list, allocating [ 89.410644096,3] opalmsg: No available node in the free list, allocating .... ).
So it seems those commits help : ----------- commit 21a0e8c14bf61472723d2acc83f98ab35ff321b4 Author: Michael Ellerman <m...@ellerman.id.au> Date: Tue Aug 1 20:29:24 2017 +1000 powerpc/mm/hash64: Make vmalloc 56T on hash commit b5048de04b32104140e5b251005404c3e0d03ccd Author: Michael Ellerman <m...@ellerman.id.au> Date: Tue Aug 1 20:29:23 2017 +1000 powerpc/mm/slb: Move comment next to the code it's referring to commit 63ee9b2ff9d306efaa61b04b8710fafe339ae441 Author: Michael Ellerman <m...@ellerman.id.au> Date: Tue Aug 1 20:29:22 2017 +1000 powerpc/mm/book3s64: Make KERN_IO_START a variable ----------- F. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1739498 Title: Ubuntu 17.10 crashes on vmalloc.c Status in The Ubuntu-power-systems project: Triaged Status in linux package in Ubuntu: Triaged Bug description: == Comment: #0 - Breno Leitao - 2017-12-19 09:48:10 == When running Ubuntu 17.10 on POWER9, I got the following error: [409038.118908] WARNING: CPU: 47 PID: 294 at /build/linux-LIHoWc/linux-4.13.0/mm/vmalloc.c:2527 pcpu_get_vm_areas+0x62c/0x660 [409038.118909] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack libcrc32c ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter kvm_hv kvm at24 ofpart ipmi_powernv ipmi_devintf ipmi_msghandler cmdlinepart powernv_flash uio_pdrv_genirq uio mtd vmx_crypto ibmpowernv crct10dif_vpmsum opal_prd binfmt_misc ip_tables x_tables autofs4 crc32c_vpmsum ast i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm tg3 ahci libahci [409038.118933] CPU: 47 PID: 294 Comm: kworker/47:0 Tainted: G W 4.13.0-12-generic #13-Ubuntu [409038.118934] Workqueue: events pcpu_balance_workfn [409038.118936] task: c000003fe3cdcc00 task.stack: c000003fe3be0000 [409038.118937] NIP: c00000000032c1fc LR: c0000000002f5fd4 CTR: 0000000000000000 [409038.118937] REGS: c000003fe3be3810 TRAP: 0700 Tainted: G W (4.13.0-12-generic) [409038.118938] MSR: 900000000282b033 <SF,HV,VEC,VSX,EE,FP,ME,IR,DR,RI,LE> [409038.118944] CR: 24024828 XER: 20040000 [409038.118944] CFAR: c00000000032bdb8 SOFTE: 1 GPR00: 000020000df00000 c000003fe3be3a90 c0000000015e3000 c000203fff6b6880 GPR04: c000203fff223608 0000000000000008 c000203fff6b6888 0000000000000000 GPR08: 000020000df00000 0000080000000000 0000000001600000 c000203fff6b6888 GPR12: 0000000000000002 c00000000faded80 c000000000f6c050 c000003fe3be3bc0 GPR16: 0000000000100000 0000000000000000 c00000000189daf8 c000203fff223608 GPR20: 000020000f500000 c000203fff2235f8 c000203fff223600 0000000000000000 GPR24: 0000000000000000 c000203fff6b6888 0000000000000001 000020000f500000 GPR28: 0000000000000002 00000000000fffff d000080000000000 d000000000000000 [409038.118963] NIP [c00000000032c1fc] pcpu_get_vm_areas+0x62c/0x660 [409038.118964] LR [c0000000002f5fd4] pcpu_create_chunk+0xb4/0x1b0 [409038.118965] Call Trace: [409038.118966] [c000003fe3be3a90] [c000003fe3be3ad0] 0xc000003fe3be3ad0 (unreliable) [409038.118968] [c000003fe3be3b60] [c0000000002f5fd4] pcpu_create_chunk+0xb4/0x1b0 [409038.118970] [c000003fe3be3ba0] [c0000000002f7890] pcpu_balance_workfn+0x600/0x960 [409038.118972] [c000003fe3be3ca0] [c0000000001205d8] process_one_work+0x298/0x5a0 [409038.118975] [c000003fe3be3d30] [c000000000120968] worker_thread+0x88/0x620 [409038.118977] [c000003fe3be3dc0] [c00000000012980c] kthread+0x1ac/0x1c0 [409038.118979] [c000003fe3be3e30] [c00000000000b4e8] ret_from_kernel_thread+0x5c/0x74 [409038.118980] Instruction dump: [409038.118981] eae30000 4191fad0 7ed3b378 e9210030 7efbbb78 7c791b78 3b400000 e9530000 ---uname output--- 4.13.0-12-generic == Comment: #3 - ANEESH K. K V - 2017-12-20 05:59:13 == https://lkml.kernel.org/r/1501583364-14909-1-git-send-email-...@ellerman.id.au The above may be related? Related discussions https://lkml.kernel.org/r/20170724134240.gl25...@dhcp22.suse.cz -aneesh == Comment: #4 - Breno Leitao - 2017-12-20 09:48:07 == I just tested with kernel 4.15.0-041500rc4 and I didn't see a problem so far. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1739498/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp