Is there a tutorial available on how to bisect the kernel and install and test the bisected version?
To me it seems the problem only occurs in connection with NFS. I could not reproduce the problem when NFS is not used. However, after reading data from NFS mounts the problem reliably occurs within a few minutes. Here's a log from a third machine: [ 359.319126] BUG: unable to handle page fault for address: ffff9d95f6147697 [ 359.319132] #PF: supervisor read access in kernel mode [ 359.319134] #PF: error_code(0x0000) - not-present page [ 359.319136] PGD 378401067 P4D 378401067 PUD 0 [ 359.319141] Oops: 0000 [#1] SMP PTI [ 359.319146] CPU: 2 PID: 321 Comm: jbd2/sda6-8 Tainted: G OE 5.4.0-40-generic #44-Ubuntu [ 359.319148] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./B85M Pro4, BIOS P2.50 12/11/2015 [ 359.319154] RIP: 0010:kmem_cache_alloc+0x7e/0x230 [ 359.319158] Code: 99 01 00 00 4d 8b 07 65 49 8b 50 08 65 4c 03 05 40 9d 56 4a 4d 8b 20 4d 85 e4 0f 84 85 01 00 00 41 8b 47 20 49 8b 3f 4c 01 e0 <48> 8b 18 48 89 c1 49 33 9f 70 01 00 00 4c 89 e0 48 0f c9 48 31 cb [ 359.319161] RSP: 0018:ffffa8984060b7c8 EFLAGS: 00010282 [ 359.319163] RAX: ffff9d95f6147697 RBX: 0000000000000000 RCX: 0000000000000001 [ 359.319165] RDX: 0000000000000019 RSI: 0000000000092a20 RDI: 0000000000031f90 [ 359.319167] RBP: ffffa8984060b7f8 R08: ffff9d954dab1f90 R09: ffff9d953f8132b8 [ 359.319169] R10: ffff9d953f8276c8 R11: 0000000000000029 R12: ffff9d95f6147697 [ 359.319171] R13: 0000000000092a20 R14: ffff9d954ada4380 R15: ffff9d954ada4380 [ 359.319174] FS: 0000000000000000(0000) GS:ffff9d954da80000(0000) knlGS:0000000000000000 [ 359.319176] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 359.319178] CR2: ffff9d95f6147697 CR3: 0000000377a0a002 CR4: 00000000001606e0 [ 359.319180] Call Trace: [ 359.319187] ? mempool_alloc_slab+0x17/0x20 [ 359.319192] mempool_alloc_slab+0x17/0x20 [ 359.319196] mempool_alloc+0x64/0x180 [ 359.319201] ? __enqueue_entity+0x96/0xa0 [ 359.319206] sg_pool_alloc+0x4f/0x60 [ 359.319211] __sg_alloc_table+0x10b/0x170 [ 359.319214] sg_alloc_table_chained+0x47/0xa0 [ 359.319217] ? mac_pton+0xb0/0xb0 [ 359.319222] scsi_init_io+0x52/0x180 [ 359.319227] sd_setup_read_write_cmnd+0x67/0x710 [ 359.319231] sd_init_command+0x11a/0x472 [ 359.319235] scsi_queue_rq+0x32e/0xa00 [ 359.319239] blk_mq_dispatch_rq_list+0x96/0x5a0 [ 359.319243] ? deadline_remove_request+0x4e/0xb0 [ 359.319246] ? dd_dispatch_request+0x1/0x1f0 [ 359.319250] blk_mq_do_dispatch_sched+0x67/0x100 [ 359.319254] blk_mq_sched_dispatch_requests+0x12d/0x180 [ 359.319259] __blk_mq_run_hw_queue+0x5a/0x110 [ 359.319263] __blk_mq_delay_run_hw_queue+0x15b/0x160 [ 359.319267] blk_mq_run_hw_queue+0x92/0x120 [ 359.319270] blk_mq_sched_insert_requests+0x74/0x100 [ 359.319273] blk_mq_flush_plug_list+0x1e8/0x290 [ 359.319277] blk_flush_plug_list+0xe3/0x110 [ 359.319280] blk_finish_plug+0x26/0x34 [ 359.319287] jbd2_journal_commit_transaction+0xda5/0x17e8 [ 359.319293] kjournald2+0xb6/0x280 [ 359.319297] ? wait_woken+0x80/0x80 [ 359.319301] kthread+0x104/0x140 [ 359.319304] ? commit_timeout+0x20/0x20 [ 359.319307] ? kthread_park+0x90/0x90 [ 359.319312] ret_from_fork+0x35/0x40 [ 359.319315] Modules linked in: rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache rfcomm msr vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) amdgpu amd_iommu_v2 gpu_sched aufs hid_logitech_hidpp cmac algif_hash joydev algif_skcipher af_alg overlay bnep input_leds hid_logitech_dj hid_generic btusb uas btrtl intel_rapl_msr btbcm btintel usbhid bluetooth hid usb_storage ecdh_generic ecc intel_rapl_common x86_pkg_temp_thermal nls_iso8859_1 intel_powerclamp mei_hdcp snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg coretemp snd_hda_codec snd_hda_core kvm_intel snd_hwdep snd_pcm kvm snd_seq_midi radeon snd_seq_midi_event snd_rawmidi snd_seq crct10dif_pclmul ghash_clmulni_intel ttm snd_seq_device snd_timer aesni_intel drm_kms_helper crypto_simd snd cryptd i2c_algo_bit glue_helper fb_sys_fops syscopyarea mei_me intel_cstate sysfillrect mei sysimgblt intel_rapl_perf soundcore mac_hid nf_log_ipv6 ip6t_REJECT nf_reject_ipv6 xt_hl ip6t_rt [ 359.319364] nf_log_ipv4 nf_log_common ipt_REJECT nf_reject_ipv4 xt_LOG xt_limit xt_addrtype xt_tcpudp sch_fq_codel xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c parport_pc ip6table_filter ip6_tables ppdev iptable_filter bpfilter lp parport drm sunrpc ip_tables x_tables autofs4 crc32_pclmul e1000e ahci i2c_i801 lpc_ich libahci video [ 359.319386] CR2: ffff9d95f6147697 [ 359.319389] ---[ end trace b059e600d0f4c10e ]--- -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1886277 Title: unable to handle page fault in mempool_alloc_slab Status in linux package in Ubuntu: Incomplete Bug description: On kernel 5.4.0-40-generic in focal I'm getting errors like this on several machines with different hardware in the first hour after boot: Jul 04 16:58:32 hostname kernel: BUG: unable to handle page fault for address: ffff9083e222e632 Jul 04 16:58:32 hostname kernel: #PF: supervisor read access in kernel mode Jul 04 16:58:32 hostname kernel: #PF: error_code(0x0000) - not-present page Jul 04 16:58:32 hostname kernel: PGD 3ac205067 P4D 3ac205067 PUD 0 Jul 04 16:58:32 hostname kernel: Oops: 0000 [#1] SMP NOPTI Jul 04 16:58:32 hostname kernel: CPU: 4 PID: 289 Comm: kworker/u16:4 Tainted: G OE 5.4.0-40-generic #44-Ubuntu Jul 04 16:58:32 hostname kernel: Hardware name: LENOVO 20N2CTO1WW/20N2CTO1WW, BIOS N2IET88W (1.66 ) 04/22/2020 Jul 04 16:58:32 hostname kernel: Workqueue: rpciod rpc_async_schedule [sunrpc] Jul 04 16:58:32 hostname kernel: RIP: 0010:kmem_cache_alloc+0x7e/0x230 Jul 04 16:58:32 hostname kernel: Code: 99 01 00 00 4d 8b 07 65 49 8b 50 08 65 4c 03 05 40 9d 56 44 4d 8b 20 4d 85 e4 0f 84 85 01 00 00 41 8b 47 20 49 8b 3f 4c 01 e0 <48> 8b 18 48 89 c1 49 33 9f 70 01 00 00 4c 89 e0 48 0f c9 48 31 cb Jul 04 16:58:32 hostname kernel: RSP: 0018:ffffbc38c046fcc8 EFLAGS: 00010282 Jul 04 16:58:32 hostname kernel: RAX: ffff9083e222e632 RBX: 0000000000000000 RCX: 0000000000000002 Jul 04 16:58:32 hostname kernel: RDX: 0000000000000009 RSI: 0000000000092800 RDI: 0000000000031fb0 Jul 04 16:58:32 hostname kernel: RBP: ffffbc38c046fcf8 R08: ffff90836c331fb0 R09: ffffffffc1436a94 Jul 04 16:58:32 hostname kernel: R10: ffff908368178d2c R11: 0000000000000018 R12: ffff9083e222e632 Jul 04 16:58:32 hostname kernel: R13: 0000000000092800 R14: ffff908367ca6140 R15: ffff908367ca6140 Jul 04 16:58:32 hostname kernel: FS: 0000000000000000(0000) GS:ffff90836c300000(0000) knlGS:0000000000000000 Jul 04 16:58:32 hostname kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 04 16:58:32 hostname kernel: CR2: ffff9083e222e632 CR3: 00000003ab80a003 CR4: 00000000003606e0 Jul 04 16:58:32 hostname kernel: Call Trace: Jul 04 16:58:32 hostname kernel: ? mempool_alloc_slab+0x17/0x20 Jul 04 16:58:32 hostname kernel: mempool_alloc_slab+0x17/0x20 Jul 04 16:58:32 hostname kernel: mempool_alloc+0x64/0x180 Jul 04 16:58:32 hostname kernel: rpc_malloc+0xa1/0xb0 [sunrpc] Jul 04 16:58:32 hostname kernel: call_allocate+0xd1/0x1b0 [sunrpc] Jul 04 16:58:32 hostname kernel: ? call_refreshresult+0x100/0x100 [sunrpc] Jul 04 16:58:32 hostname kernel: __rpc_execute+0x8c/0x3a0 [sunrpc] Jul 04 16:58:32 hostname kernel: rpc_async_schedule+0x30/0x50 [sunrpc] Jul 04 16:58:32 hostname kernel: process_one_work+0x1eb/0x3b0 Jul 04 16:58:32 hostname kernel: worker_thread+0x4d/0x400 Jul 04 16:58:32 hostname kernel: kthread+0x104/0x140 Jul 04 16:58:32 hostname kernel: ? process_one_work+0x3b0/0x3b0 Jul 04 16:58:32 hostname kernel: ? kthread_park+0x90/0x90 Jul 04 16:58:32 hostname kernel: ret_from_fork+0x35/0x40 Jul 04 16:58:32 hostname kernel: Modules linked in: rfcomm rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) msr ccm cmac algif_hash algif_skcipher af_alg aufs bnep overlay nls_iso8859_1 mei_hdcp intel_rapl_msr snd_s> Jul 04 16:58:32 hostname kernel: nvram ledtrig_audio mei_me cfg80211 mei processor_thermal_device snd_seq ucsi_acpi typec_ucsi intel_rapl_common intel_soc_dts_iosf snd_seq_device typec intel_pch_thermal snd_timer snd int3403_thermal soundcore int340x_thermal_zone i> Jul 04 16:58:32 hostname kernel: pinctrl_cannonlake video pinctrl_intel Jul 04 16:58:32 hostname kernel: CR2: ffff9083e222e632 Jul 04 16:58:32 hostname kernel: ---[ end trace cbbaed921eb439ce ]--- Jul 04 16:58:32 hostname kernel: RIP: 0010:kmem_cache_alloc+0x7e/0x230 Jul 04 16:58:32 hostname kernel: Code: 99 01 00 00 4d 8b 07 65 49 8b 50 08 65 4c 03 05 40 9d 56 44 4d 8b 20 4d 85 e4 0f 84 85 01 00 00 41 8b 47 20 49 8b 3f 4c 01 e0 <48> 8b 18 48 89 c1 49 33 9f 70 01 00 00 4c 89 e0 48 0f c9 48 31 cb Jul 04 16:58:32 hostname kernel: RSP: 0018:ffffbc38c046fcc8 EFLAGS: 00010282 Jul 04 16:58:32 hostname kernel: RAX: ffff9083e222e632 RBX: 0000000000000000 RCX: 0000000000000002 Jul 04 16:58:32 hostname kernel: RDX: 0000000000000009 RSI: 0000000000092800 RDI: 0000000000031fb0 Jul 04 16:58:32 hostname kernel: RBP: ffffbc38c046fcf8 R08: ffff90836c331fb0 R09: ffffffffc1436a94 Jul 04 16:58:32 hostname kernel: R10: ffff908368178d2c R11: 0000000000000018 R12: ffff9083e222e632 Jul 04 16:58:32 hostname kernel: R13: 0000000000092800 R14: ffff908367ca6140 R15: ffff908367ca6140 Jul 04 16:58:32 hostname kernel: FS: 0000000000000000(0000) GS:ffff90836c300000(0000) knlGS:0000000000000000 Jul 04 16:58:32 hostname kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 04 16:58:32 hostname kernel: CR2: ffff9083e222e632 CR3: 00000003ab80a003 CR4: 00000000003606e0 When booting 5.4.0-39-generic the problem does not occur. --- ProblemType: Bug ApportVersion: 2.20.11-0ubuntu27.3 Architecture: amd64 AudioDevicesInUse: USER PID ACCESS COMMAND /dev/snd/controlC0: lsysadmin 2042 F.... pulseaudio CasperMD5CheckResult: skip DistroRelease: Ubuntu 20.04 HibernationDevice: RESUME=UUID=9d3714bb-8799-42f9-a51d-790f87b0a7fc MachineType: LENOVO 20N2CTO1WW Package: linux (not installed) ProcFB: 0 i915drmfb ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-5.4.0-40-generic root=/dev/mapper/vgmagiko-root ro quiet splash vt.handoff=7 ProcVersionSignature: Ubuntu 5.4.0-40.44-generic 5.4.44 PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon. RelatedPackageVersions: linux-restricted-modules-5.4.0-40-generic N/A linux-backports-modules-5.4.0-40-generic N/A linux-firmware 1.187.1 Tags: focal Uname: Linux 5.4.0-40-generic x86_64 UpgradeStatus: No upgrade log present (probably fresh install) UserGroups: N/A _MarkForUpload: True dmi.bios.date: 04/22/2020 dmi.bios.vendor: LENOVO dmi.bios.version: N2IET88W (1.66 ) dmi.board.asset.tag: Not Available dmi.board.name: 20N2CTO1WW dmi.board.vendor: LENOVO dmi.board.version: SDK0J40709 WIN dmi.chassis.asset.tag: No Asset Information dmi.chassis.type: 10 dmi.chassis.vendor: LENOVO dmi.chassis.version: None dmi.modalias: dmi:bvnLENOVO:bvrN2IET88W(1.66):bd04/22/2020:svnLENOVO:pn20N2CTO1WW:pvrThinkPadT490:rvnLENOVO:rn20N2CTO1WW:rvrSDK0J40709WIN:cvnLENOVO:ct10:cvrNone: dmi.product.family: ThinkPad T490 dmi.product.name: 20N2CTO1WW dmi.product.sku: LENOVO_MT_20N2_BU_Think_FM_ThinkPad T490 dmi.product.version: ThinkPad T490 dmi.sys.vendor: LENOVO To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1886277/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp