Happens for both TX and RX on Ubuntu 24.04.2 with kernel
6.11.0-21-generic and mainline kernel build 6.14.2-061402-generic. It
appears to be related to hangs at reboots and poweroffs.

The TX and RX warning happens,
- On suspension (s2idle)
- On reboot/poweroff
- After boot (if nvidia driver blacklisted)

Tests:
blacklist nvidia => happens after boot, reboot hangs
blacklist nvidia + immediate reboot => does not hang
not blacklist + reboot => does not hang
not blacklist + suspend => happens after suspend, reboot hangs
blacklist thunderbolt => does not hang regardless of suspension

Abnormal reboot behaviour is,
1. Same TX and RX warnings again
2. NX-protected page execution OR kernel NULL dereference
3. recursive fault (scheduling while atomic)
4. hanging there with cursor blinking; not responding to SysRq and no WDT 
warning but not panicking either.

Kernel logs from netconsole are attached below. Tails of logs are
truncated depending on the timing r8168 gets unloaded, but on screen it
always ends with the same recursive fault as shutdown-dmesg.txt.

** Attachment added: "shutdown-dmesg.txt"
   
https://bugs.launchpad.net/ubuntu/+source/linux-signed/+bug/2052411/+attachment/5871067/+files/shutdown-dmesg.txt

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-signed in Ubuntu.
https://bugs.launchpad.net/bugs/2052411

Title:
  thunderbolt: interrupt for TX ring 0 is already enabled

Status in linux-signed package in Ubuntu:
  Confirmed

Bug description:
  After upgrading to linux-image-6.5.0-15-generic 6.5.0-15.15~22.04.1 I
  lost use of an HID device via thunderbolt (still working via USB).
  This coincided with the following dmesg trace.

  [26360.900981] ------------[ cut here ]------------
  [26360.900984] thunderbolt 0000:22:00.0: interrupt for TX ring 0 is already 
enabled
  [26360.901065] WARNING: CPU: 19 PID: 263006 at drivers/thunderbolt/nhi.c:146 
ring_interrupt_active+0x270/0x350 [thunderbolt]
  [26360.901108] Modules linked in: rfcomm vboxnetadp(OE) vboxnetflt(OE) 
vboxdrv(OE) xfrm_user xfrm_algo xt_addrtype br_netfilter ccm xt_CHECKSUM 
xt_MASQUERADE xt_conntrack snd_ctl_led snd_soc_skl_hda_dsp 
snd_soc_intel_hda_dsp_common snd_sof_probes snd_soc_hdac_hdmi cmac udp_diag 
ipt_REJECT nf_reject_ipv4 algif_hash snd_hda_codec_realtek nf_conntrack_netlink 
xt_tcpudp nft_ct snd_hda_codec_generic algif_skcipher nvidia_uvm(PO) nft_compat 
nft_masq nft_queue af_alg nft_chain_nat nf_nat_h323 nf_conntrack_h323 
nf_nat_pptp nf_conntrack_pptp nf_nat_tftp nf_conntrack_tftp nf_nat_sip 
nf_conntrack_sip nf_nat_irc nf_conntrack_irc nf_nat_ftp nf_conntrack_ftp 
iptable_nat nf_nat nf_tables nf_conntrack nfnetlink_queue nf_defrag_ipv6 
nf_defrag_ipv4 libcrc32c bridge stp llc nfnetlink tcp_diag inet_diag overlay 
bnep snd_soc_dmic snd_sof_pci_intel_tgl snd_sof_intel_hda_common 
soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda 
snd_sof_pci snd_sof_xtensa_dsp snd_sof intel_uncore_
 frequency intel_uncore_frequency_common
  [26360.901169]  intel_tcc_cooling snd_sof_utils snd_soc_hdac_hda 
snd_hda_ext_core x86_pkg_temp_thermal intel_powerclamp snd_soc_acpi_intel_match 
coretemp snd_soc_acpi soundwire_generic_allocation soundwire_bus nvidia_drm(PO) 
iwlmvm kvm_intel snd_soc_core snd_compress nvidia_modeset(PO) i915 
snd_hda_codec_hdmi ac97_bus mac80211 snd_pcm_dmaengine kvm btusb 
crct10dif_pclmul polyval_clmulni btrtl polyval_generic uvcvideo 
ghash_clmulni_intel drm_buddy btbcm aesni_intel snd_hda_intel videobuf2_vmalloc 
hid_sensor_als btintel snd_hda_scodec_cs35l41_spi uvc ttm snd_intel_dspcfg 
crypto_simd btmtk videobuf2_memops mei_hdcp mei_pxp intel_rapl_msr libarc4 
snd_intel_sdw_acpi cryptd videobuf2_v4l2 nvidia(PO) binfmt_misc irqbypass 
snd_usb_audio hid_sensor_trigger bluetooth iwlwifi snd_hda_codec thinkpad_acpi 
industrialio_triggered_buffer videodev drm_display_helper spi_nor kfifo_buf 
snd_usbmidi_lib snd_seq_midi snd_hda_core videobuf2_common rapl 
hid_sensor_iio_common snd_ump hid_magicmouse think_l
 mi snd_hda_scodec_cs35l41_i2c
  [26360.901228]  snd_seq_midi_event ecdh_generic intel_cstate nls_iso8859_1 
serio_raw cec snd_hda_scodec_cs35l41 processor_thermal_device_pci industrialio 
firmware_attributes_class wmi_bmof input_leds joydev apple_mfi_fastcharge mc 
snd_rawmidi mtd nvram ecc snd_hwdep rc_core mei_me processor_thermal_device 
snd_hda_cs_dsp_ctls cfg80211 snd_pcm snd_seq cs_dsp hid_multitouch 
processor_thermal_rfim mei snd_soc_cs35l41_lib processor_thermal_mbox 
drm_kms_helper snd_seq_device processor_thermal_rapl i2c_algo_bit snd_timer 
intel_rapl_common mac_hid snd soundcore serial_multi_instantiate ledtrig_audio 
int3403_thermal platform_profile int340x_thermal_zone int3400_thermal 
acpi_thermal_rel acpi_pad acpi_tad sch_fq_codel msr parport_pc ppdev drm lp 
parport efi_pstore ip_tables x_tables autofs4 hid_sensor_hub cdc_ncm cdc_ether 
usbnet r8152 mii usbhid hid_generic rtsx_pci_sdmmc i2c_i801 nvme spi_intel_pci 
crc32_pclmul psmouse thunderbolt i2c_smbus spi_intel nvme_core ucsi_acpi 
intel_lpss_pci rtsx_
 pci typec_ucsi intel_lpss xhci_pci
  [26360.901294]  nvme_common idma64 xhci_pci_renesas i2c_hid_acpi typec 
i2c_hid hid video pinctrl_tigerlake wmi
  [26360.901305] CPU: 19 PID: 263006 Comm: kworker/19:2 Tainted: P           OE 
     6.5.0-15-generic #15~22.04.1-Ubuntu
  [26360.901309] Hardware name: LENOVO 21FV003CAU/21FV003CAU, BIOS N3ZET21W 
(1.08 ) 07/07/2023
  [26360.901311] Workqueue: pm pm_runtime_work
  [26360.901317] RIP: 0010:ring_interrupt_active+0x270/0x350 [thunderbolt]
  [26360.901348] Code: 89 5d c8 44 89 45 d4 e8 8e da 38 cc 44 8b 45 d4 48 8b 4d 
c0 49 89 d9 48 8b 55 b8 48 89 c6 48 c7 c7 78 70 5e c0 e8 50 7e 97 cb <0f> 0b 4d 
8b 7f 08 44 8b 5d c8 49 8b 77 18 45 84 ed 0f 85 d9 fe ff
  [26360.901351] RSP: 0018:ffffb091a6213c48 EFLAGS: 00010046
  [26360.901354] RAX: 0000000000000000 RBX: ffffffffc05eb6db RCX: 
0000000000000000
  [26360.901357] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 
0000000000000000
  [26360.901358] RBP: ffffb091a6213c98 R08: 0000000000000000 R09: 
0000000000000000
  [26360.901360] R10: 0000000000000000 R11: 0000000000000000 R12: 
0000000000000001
  [26360.901361] R13: 0000000000000001 R14: 0000000000038200 R15: 
ffff9763017d0240
  [26360.901363] FS:  0000000000000000(0000) GS:ffff976a3f6c0000(0000) 
knlGS:0000000000000000
  [26360.901366] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [26360.901368] CR2: 00007ffc979f4d00 CR3: 000000010fd60000 CR4: 
0000000000752ee0
  [26360.901370] PKRU: 55555554
  [26360.901372] Call Trace:
  [26360.901374]  <TASK>
  [26360.901379]  ? show_regs+0x6d/0x80
  [26360.901386]  ? __warn+0x89/0x160
  [26360.901392]  ? ring_interrupt_active+0x270/0x350 [thunderbolt]
  [26360.901424]  ? report_bug+0x17e/0x1b0
  [26360.901429]  ? handle_bug+0x46/0x90
  [26360.901435]  ? exc_invalid_op+0x18/0x80
  [26360.901439]  ? asm_exc_invalid_op+0x1b/0x20
  [26360.901447]  ? ring_interrupt_active+0x270/0x350 [thunderbolt]
  [26360.901478]  ? ring_interrupt_active+0x270/0x350 [thunderbolt]
  [26360.901509]  ? __pfx_pci_pm_runtime_resume+0x10/0x10
  [26360.901514]  tb_ring_start+0x17e/0x330 [thunderbolt]
  [26360.901545]  tb_ctl_start+0x1d/0xc0 [thunderbolt]
  [26360.901577]  tb_domain_runtime_resume+0x19/0x50 [thunderbolt]
  [26360.901620]  nhi_runtime_resume+0x64/0x80 [thunderbolt]
  [26360.901651]  pci_pm_runtime_resume+0xa0/0xf0
  [26360.901655]  __rpm_callback+0x4d/0x170
  [26360.901660]  rpm_callback+0x6d/0x80
  [26360.901665]  ? __pfx_pci_pm_runtime_resume+0x10/0x10
  [26360.901669]  rpm_resume+0x594/0x7e0
  [26360.901673]  ? __pfx_pm_runtime_work+0x10/0x10
  [26360.901676]  pm_runtime_work+0x80/0xe0
  [26360.901679]  process_one_work+0x23d/0x450
  [26360.901684]  worker_thread+0x50/0x3f0
  [26360.901687]  ? __pfx_worker_thread+0x10/0x10
  [26360.901691]  kthread+0xef/0x120
  [26360.901695]  ? __pfx_kthread+0x10/0x10
  [26360.901700]  ret_from_fork+0x44/0x70
  [26360.901705]  ? __pfx_kthread+0x10/0x10
  [26360.901709]  ret_from_fork_asm+0x1b/0x30
  [26360.901714]  </TASK>
  [26360.901715] ---[ end trace 0000000000000000 ]---

  I was able to find references to the message in
  https://bugzilla.redhat.com/show_bug.cgi?id=1703369, but that report
  claims a fix to be in 5.0.14. Also in
  https://bugzilla.kernel.org/show_bug.cgi?id=217343, but claims a fix
  is in 6.3.9.

  
  Other details:

  $ uname -r                            
  6.5.0-15-generic
  $ lsb_release -a
  No LSB modules are available.
  Distributor ID:       Ubuntu
  Description:  Ubuntu 22.04.3 LTS
  Release:      22.04
  Codename:     jammy
  $ dpkg -l linux-image-6.5.0-15-generic 
  Desired=Unknown/Install/Remove/Purge/Hold
  | Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
  |/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
  ||/ Name                         Version             Architecture Description
  
+++-============================-===================-============-=================================
  ii  linux-image-6.5.0-15-generic 6.5.0-15.15~22.04.1 amd64        Signed 
kernel image generic

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-signed/+bug/2052411/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to