Package: src:linux X-Debbugs-Cc: sb56...@gmail.com Version: 6.1.94-1 Severity: important
Hi again, apologies for my slow response. This is a follow-up to https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1076034 which got archived. Would it be possible to re-open it? This bug continues to affect me on this hardware with all of the kernel 6.1.x releases in Debian Stable. I also finally managed to test the latest 6.12 kernel as requested, and unfortunately the same issue happened to me twice today. (On my hardware there is another major unrelated bug with kernel versions >6.12.1, so I tested 6.12.1 from Xanmod because they have a repository of previous versions.) The first time `journalctl` only showed this at the time of the lockup: ---------------------------------------- Dec 24 10:43:07 IntelNUC9 kernel: i915 0000:00:02.0: [drm] *ERROR* media: timed out waiting for forcewake ack request. Dec 24 10:43:07 IntelNUC9 kernel: i915 0000:00:02.0: [drm] CI tainted: 0x9 by fw_domains_get_with_fallback+0x1ff/0x280 [i915] ---------------------------------------- But the second time there was a complete trace: ---------------------------------------- Dec 24 11:52:23 IntelNUC9 kernel: i915 0000:00:02.0: [drm] *ERROR* media: timed out waiting for forcewake ack request. Dec 24 11:52:23 IntelNUC9 kernel: i915 0000:00:02.0: [drm] CI tainted: 0x9 by fw_domains_get_with_fallback+0x1ff/0x280 [i915] Dec 24 11:52:23 IntelNUC9 kernel: i915 0000:00:02.0: [drm] Resetting rcs0 for preemption time out Dec 24 11:52:23 IntelNUC9 kernel: i915 0000:00:02.0: [drm] *ERROR* GT0: rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001} Dec 24 11:52:23 IntelNUC9 kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 9:1:eedfffff, in zoom [4127] Dec 24 11:52:33 IntelNUC9 kernel: i915 0000:00:02.0: [drm] *ERROR* media: timed out waiting for forcewake ack request. Dec 24 11:52:33 IntelNUC9 kernel: i915 0000:00:02.0: [drm] CI tainted: 0x9 by fw_domains_get_with_fallback+0x1ff/0x280 [i915] Dec 24 11:52:36 IntelNUC9 kernel: i915 0000:00:02.0: [drm] *ERROR* media: timed out waiting for forcewake ack request. Dec 24 11:52:36 IntelNUC9 kernel: i915 0000:00:02.0: [drm] CI tainted: 0x9 by fw_domains_get_with_fallback+0x1ff/0x280 [i915] Dec 24 11:52:37 IntelNUC9 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000280 Dec 24 11:52:37 IntelNUC9 kernel: #PF: supervisor read access in kernel mode Dec 24 11:52:37 IntelNUC9 kernel: #PF: error_code(0x0000) - not-present page Dec 24 11:52:37 IntelNUC9 kernel: PGD 0 P4D 0 Dec 24 11:52:37 IntelNUC9 kernel: Oops: Oops: 0000 [#1] PREEMPT SMP PTI Dec 24 11:52:37 IntelNUC9 kernel: CPU: 1 UID: 0 PID: 234 Comm: kworker/1:1H Tainted: G W OE 6.12.1-x64v3-xanmod1 #0~20241122.ge695ae7 Dec 24 11:52:37 IntelNUC9 kernel: Tainted: [W]=WARN, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE Dec 24 11:52:37 IntelNUC9 kernel: Hardware name: Intel(R) Client Systems LAPQC71A/LAPQC71A, BIOS QCCFL357.0144.2022.0124.1433 01/24/2022 Dec 24 11:52:37 IntelNUC9 kernel: Workqueue: events_highpri heartbeat [i915] Dec 24 11:52:37 IntelNUC9 kernel: RIP: 0010:__i915_gpu_coredump+0x20b/0x7c0 [i915] Dec 24 11:52:37 IntelNUC9 kernel: Code: 2b 46 28 89 44 24 08 e8 83 49 8a ea 8b 44 24 08 48 8b 74 24 28 85 c0 79 3a 48 8b 44 24 20 4c 8b 46 28 48 8d 55 18 48 8b 4e 20 <44> 0f b7 88 80 02 00 00 48 8b 45 08 48 8b 38 48 85 ff 74 04 48 8b Dec 24 11:52:37 IntelNUC9 kernel: RSP: 0018:ffffabaf80bcfc80 EFLAGS: 00010286 Dec 24 11:52:37 IntelNUC9 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000001058 Dec 24 11:52:37 IntelNUC9 kernel: RDX: ffff9a1a698ea018 RSI: ffff9a1c830d4b00 RDI: ffff9a1a507cb600 Dec 24 11:52:37 IntelNUC9 kernel: RBP: ffff9a1a698ea000 R08: 0000000000052854 R09: 00000000ffffffff Dec 24 11:52:37 IntelNUC9 kernel: R10: 0000000000000000 R11: 000000000000e164 R12: 0000000000000000 Dec 24 11:52:37 IntelNUC9 kernel: R13: ffff9a1a61360000 R14: ffff9a1a5ea8a000 R15: ffff9a1a86187400 Dec 24 11:52:37 IntelNUC9 kernel: FS: 0000000000000000(0000) GS:ffff9a21dd880000(0000) knlGS:0000000000000000 Dec 24 11:52:37 IntelNUC9 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Dec 24 11:52:37 IntelNUC9 kernel: CR2: 0000000000000280 CR3: 000000059e834005 CR4: 00000000003726f0 Dec 24 11:52:37 IntelNUC9 kernel: Call Trace: Dec 24 11:52:37 IntelNUC9 kernel: <TASK> Dec 24 11:52:37 IntelNUC9 kernel: ? __die+0x1f/0x60 Dec 24 11:52:37 IntelNUC9 kernel: ? page_fault_oops+0x14c/0x540 Dec 24 11:52:37 IntelNUC9 kernel: ? intel_gt_mcr_lock+0x33/0x130 [i915] Dec 24 11:52:37 IntelNUC9 kernel: ? intel_gt_mcr_unlock+0x15/0x60 [i915] Dec 24 11:52:37 IntelNUC9 kernel: ? exc_page_fault+0x7d/0x190 Dec 24 11:52:37 IntelNUC9 kernel: ? asm_exc_page_fault+0x22/0x30 Dec 24 11:52:37 IntelNUC9 kernel: ? __i915_gpu_coredump+0x20b/0x7c0 [i915] Dec 24 11:52:37 IntelNUC9 kernel: i915_capture_error_state+0x5d/0xb0 [i915] Dec 24 11:52:37 IntelNUC9 kernel: intel_gt_handle_error+0x390/0x3b0 [i915] Dec 24 11:52:37 IntelNUC9 kernel: ? set_next_task_idle+0x36/0x80 Dec 24 11:52:37 IntelNUC9 kernel: ? finish_task_switch.isra.0+0x8f/0x2a0 Dec 24 11:52:37 IntelNUC9 kernel: heartbeat+0x3bc/0x3d0 [i915] Dec 24 11:52:37 IntelNUC9 kernel: process_one_work+0x168/0x370 Dec 24 11:52:37 IntelNUC9 kernel: worker_thread+0x2ea/0x420 Dec 24 11:52:37 IntelNUC9 kernel: ? rescuer_thread+0x4c0/0x4c0 Dec 24 11:52:37 IntelNUC9 kernel: kthread+0xcb/0x100 Dec 24 11:52:37 IntelNUC9 kernel: ? kthread_park+0x80/0x80 Dec 24 11:52:37 IntelNUC9 kernel: ret_from_fork+0x2d/0x50 Dec 24 11:52:37 IntelNUC9 kernel: ? kthread_park+0x80/0x80 Dec 24 11:52:37 IntelNUC9 kernel: ret_from_fork_asm+0x11/0x20 Dec 24 11:52:37 IntelNUC9 kernel: </TASK> Dec 24 11:52:37 IntelNUC9 kernel: Modules linked in: ccm rfcomm snd_hrtimer snd_seq_dummy snd_seq_midi snd_seq_oss snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device cmac algif_hash algif_skcipher af_alg bnep zram zsmalloc lz4hc_compress uvcvideo videobuf2_vmalloc uvc videobuf2_memops videobuf2_v4l2 btusb videobuf2_common btrtl btintel btbcm videodev btmtk mc bluetooth binfmt_misc nls_iso8859_1 intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common snd_sof_pci_intel_cnl snd_sof_intel_hda_generic soundwire_intel soundwire_cadence snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_intel_hda_mlink snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_acpi_intel_match soundwire_generic_allocation snd_soc_acpi soundwire_bus snd_soc_avs snd_hda_codec_realtek snd_soc_hda_codec snd_hda_codec_generic snd_hda_ext_core snd_hda_scodec_component snd_soc_core snd_hda_codec_hdmi snd_compress iwlmvm ac97_bus snd_pcm_dmaengine intel_tcc_cooling snd_hda_intel x86_pkg_temp_thermal intel_powerclamp Dec 24 11:52:37 IntelNUC9 kernel: snd_intel_dspcfg coretemp snd_intel_sdw_acpi mac80211 snd_hda_codec kvm_intel snd_hda_core libarc4 mei_pxp mei_hdcp snd_pcsp snd_hwdep kvm cmdlinepart snd_pcm_oss iwlwifi asus_wmi snd_mixer_oss spi_nor rapl qc71_laptop(OE) platform_profile intel_cstate sparse_keymap intel_wmi_thunderbolt wmi_bmof ee1004 snd_pcm ucsi_ccg iTCO_wdt mtd cfg80211 typec_ucsi intel_pmc_bxt snd_timer mei_me 8250_dw iTCO_vendor_support typec nvidiafb snd vgastate mei soundcore fb_ddc intel_pch_thermal intel_pmc_core intel_vsec pmt_telemetry acpi_tad acpi_pad pmt_class input_leds joydev mac_hid serio_raw msr parport_pc ppdev lp parport efi_pstore dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic xor raid6_pq libcrc32c usbkbd usbmouse usbhid i915 nouveau drm_gpuvm crct10dif_pclmul drm_exec crc32_pclmul gpu_sched drm_buddy polyval_clmulni r8169 i2c_algo_bit polyval_generic hid_multitouch drm_ttm_helper hid_generic ghash_clmulni_intel i2c_i801 realtek ttm nvme sha256_ssse3 i2c_mux mdio_devres spi_intel_pci uas sha1_ssse3 Dec 24 11:52:37 IntelNUC9 kernel: psmouse mxm_wmi i2c_smbus spi_intel nvme_core drm_display_helper thunderbolt i2c_hid_acpi libphy intel_lpss_pci ahci usb_storage cec nvme_auth intel_lpss i2c_hid i2c_nvidia_gpu libahci idma64 rc_core i2c_ccgx_ucsi hid video wmi pinctrl_cannonlake aesni_intel crypto_simd cryptd Dec 24 11:52:37 IntelNUC9 kernel: CR2: 0000000000000280 Dec 24 11:52:37 IntelNUC9 kernel: ---[ end trace 0000000000000000 ]--- Dec 24 11:52:37 IntelNUC9 kernel: RIP: 0010:__i915_gpu_coredump+0x20b/0x7c0 [i915] Dec 24 11:52:37 IntelNUC9 kernel: Code: 2b 46 28 89 44 24 08 e8 83 49 8a ea 8b 44 24 08 48 8b 74 24 28 85 c0 79 3a 48 8b 44 24 20 4c 8b 46 28 48 8d 55 18 48 8b 4e 20 <44> 0f b7 88 80 02 00 00 48 8b 45 08 48 8b 38 48 85 ff 74 04 48 8b Dec 24 11:52:37 IntelNUC9 kernel: RSP: 0018:ffffabaf80bcfc80 EFLAGS: 00010286 Dec 24 11:52:37 IntelNUC9 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000001058 Dec 24 11:52:37 IntelNUC9 kernel: RDX: ffff9a1a698ea018 RSI: ffff9a1c830d4b00 RDI: ffff9a1a507cb600 Dec 24 11:52:37 IntelNUC9 kernel: RBP: ffff9a1a698ea000 R08: 0000000000052854 R09: 00000000ffffffff Dec 24 11:52:37 IntelNUC9 kernel: R10: 0000000000000000 R11: 000000000000e164 R12: 0000000000000000 Dec 24 11:52:37 IntelNUC9 kernel: R13: ffff9a1a61360000 R14: ffff9a1a5ea8a000 R15: ffff9a1a86187400 Dec 24 11:52:37 IntelNUC9 kernel: FS: 0000000000000000(0000) GS:ffff9a21dd880000(0000) knlGS:0000000000000000 Dec 24 11:52:37 IntelNUC9 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Dec 24 11:52:37 IntelNUC9 kernel: CR2: 0000000000000280 CR3: 00000001bfa1e001 CR4: 00000000003726f0 Dec 24 11:52:37 IntelNUC9 kernel: note: kworker/1:1H[234] exited with irqs disabled Dec 24 11:52:42 IntelNUC9 kernel: Fence expiration time out i915-0000:00:02.0:zoom[4127]:52854! Dec 24 11:52:42 IntelNUC9 kernel: Fence expiration time out i915-0000:00:02.0:zoom[4127]:52856! Dec 24 11:52:43 IntelNUC9 kernel: rcu: INFO: rcu_preempt detected stalls on CPUs/tasks: Dec 24 11:52:43 IntelNUC9 kernel: rcu: 3-...0: (1 GPs behind) idle=1404/1/0x4000000000000000 softirq=150735/150736 fqs=1353 Dec 24 11:52:43 IntelNUC9 kernel: rcu: (detected by 8, t=5256 jiffies, g=208369, q=2152 ncpus=12) Dec 24 11:52:43 IntelNUC9 kernel: Sending NMI from CPU 8 to CPUs 3: ----------------------------------------