Public bug reported:

We have some severs experiencing a loss of network connection after some
weeks of uptime.

PCI:
43:00.0 Ethernet controller: Broadcom Inc. and subsidiaries BCM57414 
NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller (rev 01)
43:00.1 Ethernet controller: Broadcom Inc. and subsidiaries BCM57414 
NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller (rev 01)

Following the kernel log, one interface is failing, and a few hours
later the second interface fails too:

[Tue Dec 16 10:53:11 2025] DMAR: DRHD: handling fault status reg 2
[Tue Dec 16 10:53:11 2025] DMAR: [DMA Read NO_PASID] Request device [43:00.0] 
fault addr 0xfc5a9000 [fault reason 0x71] SM: Present bit in first-level paging 
entry is clear
[Tue Dec 16 10:53:11 2025] bnxt_en 0000:43:00.1 ens1f1np1: Fatal firmware reset 
event, data1: 0x201, data2: 0xda27, min wait 1300 ms, max wait 4200 ms
[Tue Dec 16 10:53:11 2025] bnxt_en 0000:43:00.1 ens1f1np1: 
hwrm_tunnel_dst_port_free failed. rc:-16
[Tue Dec 16 10:53:11 2025] bond0: (slave ens1f1np1): link status definitely 
down, disabling slave
[Tue Dec 16 10:53:11 2025] bnxt_en 0000:43:00.0 ens1f0np0: hwrm req_type 0x23 
seq id 0x73cc error 0xf
[Tue Dec 16 10:53:12 2025] bnxt_en 0000:43:00.0 ens1f0np0: hwrm req_type 0xb4 
seq id 0x73cd error 0xf
[Tue Dec 16 10:53:12 2025] bnxt_en 0000:43:00.1 ens1f1np1: Device requests max 
timeout of 100 seconds, may trigger hung task watchdog
[Tue Dec 16 10:53:12 2025] bnxt_en 0000:43:00.0 ens1f0np0: Abandoning msg {0x23 
0x73ce} len: 0 due to firmware status: 0x2000001
[Tue Dec 16 10:53:12 2025] bond0: (slave ens1f1np1): link status definitely up, 
25000 Mbps full duplex
[Tue Dec 16 10:53:13 2025] bnxt_en 0000:43:00.0 ens1f0np0: Abandoning msg {0xb4 
0x73cf} len: 0 due to firmware status: 0x2000001
[Tue Dec 16 10:53:13 2025] bnxt_en 0000:43:00.0 ens1f0np0: Abandoning msg {0x23 
0x73d0} len: 0 due to firmware status: 0x2000001
[Tue Dec 16 10:53:14 2025] bnxt_en 0000:43:00.0 ens1f0np0: Abandoning msg {0xb4 
0x73d1} len: 0 due to firmware status: 0x2000001
...
[Tue Dec 16 10:53:25 2025] bnxt_en 0000:43:00.0 ens1f0np0: NETDEV WATCHDOG: 
CPU: 39: transmit queue 5 timed out 5301 ms
[Tue Dec 16 10:53:25 2025] bnxt_en 0000:43:00.0 ens1f0np0: TX timeout detected, 
starting reset task!
...
[Tue Dec 16 13:17:32 2025] DMAR: DRHD: handling fault status reg 2
[Tue Dec 16 13:17:33 2025] DMAR: [DMA Read NO_PASID] Request device [43:00.1] 
fault addr 0xfde48000 [fault reason 0x71] SM: Present bit in first-level paging 
entry is clear
[Tue Dec 16 13:17:33 2025] bnxt_en 0000:43:00.1 ens1f1np1: hwrm req_type 0x23 
seq id 0xdcfe error 0xf
[Tue Dec 16 13:17:33 2025] bnxt_en 0000:43:00.1 ens1f1np1: hwrm req_type 0xb4 
seq id 0xdcff error 0xf
[Tue Dec 16 13:17:35 2025] bnxt_en 0000:43:00.1 ens1f1np1: Abandoning msg {0xb4 
0xdd04} len: 0 due to firmware status: 0x2000001
[Tue Dec 16 13:17:36 2025] bnxt_en 0000:43:00.1 ens1f1np1: Abandoning msg {0xb4 
0xdd06} len: 0 due to firmware status: 0x2000001
...
[Tue Dec 16 13:17:45 2025] bnxt_en 0000:43:00.1 ens1f1np1: NETDEV WATCHDOG: 
CPU: 34: transmit queue 3 timed out 5066 ms
[Tue Dec 16 13:17:45 2025] bnxt_en 0000:43:00.1 ens1f1np1: TX timeout detected, 
starting reset task!


(See full logs attached)

ethtool -i ens1f1np1 
driver: bnxt_en
version: 6.8.0-85-generic
firmware-version: 230.0.157.0/pkg 230.1.116.0
expansion-rom-version: 
bus-info: 0000:43:00.1
supports-statistics: yes
supports-test: yes
supports-eeprom-access: yes
supports-register-dump: yes
supports-priv-flags: no


This happened independently for different severs, always after some weeks of 
uptime.

Is this a firmware issue or maybe a driver problem?

We also checked the temperature of the device which was okay.

** Affects: linux (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2147011

Title:
  bnxt_en: BCM57414 NetXtreme-E: Crash (Fatal firmware reset) of both
  interfaces within hours

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2147011/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to