(In reply to Sean Kennedy from comment #205)
> I also have the issue.  Using Proxmox 7.2 (Debian Bullseye) with a Lenovo
> M910q core-i7-7700T,  using two TPLink UE300 (RTL8153) USB to 1Gbe Ethernet
> adapters. Each one is stable in a lower USB slot. Swapping the adapters does
> not change the behavior and only impacts the USB device in the higher slot. 
> Changes to different ports without change.

Update - Tried a different dongle - a 2.5Gbe and have two hard drives
attached to the system.  Doesn't matter where the 2.5Gbe dongle is
attached,  it eventually errors with "WARN Set TR Deq Ptr cmd failed"
And the error rate is only around six times a day right now:

8156 Realtek Semiconductor Corp. USB 10/100/1G/2.5G LAN

# dmesg -T | grep xhci
[Tue Sep  6 13:37:13 2022] xhci_hcd 0000:00:14.0: xHCI Host Controller
[Tue Sep  6 13:37:13 2022] xhci_hcd 0000:00:14.0: new USB bus registered, 
assigned bus number 1
[Tue Sep  6 13:37:13 2022] xhci_hcd 0000:00:14.0: hcc params 0x200077c1 hci 
version 0x100 quirks 0x0000000000009810
[Tue Sep  6 13:37:13 2022] usb usb1: Manufacturer: Linux 5.15.39-4-pve xhci-hcd
[Tue Sep  6 13:37:13 2022] xhci_hcd 0000:00:14.0: xHCI Host Controller
[Tue Sep  6 13:37:13 2022] xhci_hcd 0000:00:14.0: new USB bus registered, 
assigned bus number 2
[Tue Sep  6 13:37:13 2022] xhci_hcd 0000:00:14.0: Host supports USB 3.0 
SuperSpeed
[Tue Sep  6 13:37:13 2022] usb usb2: Manufacturer: Linux 5.15.39-4-pve xhci-hcd
[Tue Sep  6 13:37:13 2022] usb 2-1: new SuperSpeed USB device number 2 using 
xhci_hcd
[Tue Sep  6 13:37:14 2022] usb 2-3: new SuperSpeed USB device number 3 using 
xhci_hcd
[Tue Sep  6 13:37:14 2022] usb 2-4: new SuperSpeed USB device number 4 using 
xhci_hcd
[Tue Sep  6 14:39:22 2022] xhci_hcd 0000:00:14.0: WARN Set TR Deq Ptr cmd 
failed due to incorrect slot or ep state.
[Tue Sep  6 14:39:22 2022] usb 2-4: new SuperSpeed USB device number 5 using 
xhci_hcd
[Tue Sep  6 18:44:01 2022] xhci_hcd 0000:00:14.0: WARN Set TR Deq Ptr cmd 
failed due to incorrect slot or ep state.
[Tue Sep  6 18:44:01 2022] xhci_hcd 0000:00:14.0: WARN Set TR Deq Ptr cmd 
failed due to incorrect slot or ep state.
[Tue Sep  6 18:44:02 2022] usb 2-4: new SuperSpeed USB device number 6 using 
xhci_hcd
[Tue Sep  6 22:19:06 2022] xhci_hcd 0000:00:14.0: WARN Set TR Deq Ptr cmd 
failed due to incorrect slot or ep state.
[Tue Sep  6 22:19:07 2022] usb 2-4: new SuperSpeed USB device number 7 using 
xhci_hcd


Since this drops the device from the system and offlines the link,  I created a 
simple script to detect zero UP ethernet devices via cron once a minute and 
runs a ifnet -a.   It's clunky but works.  

crontab:
# m h  dom mon dow   command
* * * * * /root/fixnet.sh >/dev/null 2>&1

fixnet.sh:
#!/bin/sh

STATE=`ip link | grep " enx" | grep UP | wc -l`
if [ $STATE -gt 0 ]; then
  # All good.  Exit
  exit 0
fi

/usr/sbin/ifup -a
sleep 20

ping -c 1 10.0.0.1 | grep "1 received"
if [ $? -eq 0 ]; then
  # Network looks good. Exit.
  exit 0
fi

sleep 310
ping -c 1 10.0.0.1 | grep "1 received"
if [ $? -ne 0 ]; then
  # The network is still down.
  systemctl reboot
fi

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1749961

Title:
  xhci_hcd: TRB DMA errors reported with ASMedia ASM1142 USB 3.1
  Controller

Status in Linux:
  Confirmed
Status in linux package in Ubuntu:
  In Progress
Status in linux source package in Trusty:
  Won't Fix
Status in linux source package in Xenial:
  Confirmed
Status in linux source package in Bionic:
  Confirmed
Status in linux source package in Focal:
  Confirmed
Status in linux package in Debian:
  Confirmed

Bug description:
  It was observed that while trying to use a 4K USB webcam connected to
  USB port provided by ASMedia ASM1142 USB 3.1 Controller, the webcam
  does not work and kernel log shows the following messages:

  [431.928016] xhci_hcd 0000:12:00.0: ERROR Transfer event TRB DMA ptr not part 
of current TD ep_index 2 comp_code 13
  [431.928021] xhci_hcd 0000:12:00.0: Looking for event-dma 0000003f3330e020 
trb-start 0000003f3330e000 trb-end 0000003f3330e000 seg-start 0000003f3330e000 
seg-end 0000003f3330eff0
  [431.928024] xhci_hcd 0000:12:00.0: ERROR Transfer event TRB DMA ptr not part 
of current TD ep_index 2 comp_code 13
  [431.928026] xhci_hcd 0000:12:00.0: Looking for event-dma 0000003f3330e030 
trb-start 0000003f3330e000 trb-end 0000003f3330e000 seg-start 0000003f3330e000 
seg-end 0000003f3330eff0
  [431.928027] xhci_hcd 0000:12:00.0: ERROR Transfer event TRB DMA ptr not part 
of current TD ep_index 2 comp_code 13
  [431.928029] xhci_hcd 0000:12:00.0: Looking for event-dma 0000003f3330e050 
trb-start 0000003f3330e000 trb-end 0000003f3330e000 seg-start 0000003f3330e000 
seg-end 0000003f3330eff0
  [431.928386] xhci_hcd 0000:12:00.0: ERROR Transfer event TRB DMA ptr not part 
of current TD ep_index 2 comp_code 13

  A similar issue was already reported on Launchpad:
  https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1667750

  The fix to this issue seems to be the following patch:
  
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=9da5a109

  Tests in our scenario with this patch proved still broken. Our next
  approach is to modify the patch a bit and re-test.

  This LP will be used to document our progress in the investigation.

To manage notifications about this bug go to:
https://bugs.launchpad.net/linux/+bug/1749961/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to