I also have this on two systems, both running Ubuntu 24.04.1.

First system:
ASUS Pro WS TRX50-SAGE WIFI + AMD Threadripper 7970X
Ubuntu 24.04 Desktop + kernel 6.11.x from Xanmod
SSDs that have this issue: Solidigm P44 Pro 2TB and Samsung 990 Pro 4TB

Second system:
Gigabyte MZ32-AR0 (rev 1.0) + AMD Epyc 7302P
Ubuntu 24.04 Server + kernel 6.8-generic from stock repositories
SSDs that have this issue: SkHynix P41 Platinum 2TB (almost identical to P44 
Pro above with different firmware) and Samsung 990 Pro 4TB (different unit from 
first system)

I tried nvme_core.default_ps_max_latency_us=0 separately and with pcie_aspm=off 
pcie_port_pm=off.
Maybe coincidence, but with all 3 it survives a bit longer than without, but 
still reliably crashes when running `btrfs scrub start /`.
Attaching logs from second system that started with 
`nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off`.

Things I have tried so far without any luck:
* putting SSD into different PCIe slot (both native M.2 on the motherboard and 
through adapter into regular PCIe slot)
* forcing PCIe 3.0 speed for these PCIe 4.0 SSDs
* various BIOS options (like enabling/disabling AER, ACS, some others)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1910866

Title:
  nvme drive fails after some time

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1910866/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to