Package: linux-image Version: 6.11.10-amd64 On a Dell Precision 3650 Tower (Firmware Version: 1.35.0),we have two Samsung SSD 990 PRO with Heatsink 4TB with different firmware versions shutting down randomly with the following message:
```Dec 30 08:05:11 [redacted] kernel: nvme nvme1: Disabling device after reset failure: -19 Dec 30 08:05:11 [redacted] kernel: nvme 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible Dec 30 08:05:11 [redacted] kernel: nvme nvme1: Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off" and report a bug Dec 30 08:05:11 [redacted] kernel: nvme nvme1: Does your device have a faulty power saving mode enabled? Dec 30 08:05:11 [redacted] kernel: nvme nvme1: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff
```Device is thereafter not responsive to any command using the `nvme` command. Starting the system with the suggested kernel parameters yields no change (the previous log snippet is taken from a system started with those arguments).
Disks as described by `lshw`: ``` [...] *-nvme description: NVMe device product: Samsung SSD 990 PRO with Heatsink 4TB vendor: Samsung Electronics Co Ltd physical id: 0 bus info: pci@0000:01:00.0 logical name: /dev/nvme0 version: 4B2QJXD7 serial: S7HRNJ0X201019H width: 64 bits clock: 33MHzcapabilities: nvme pm msi pciexpress msix nvm_express bus_master cap_list configuration: driver=nvme latency=0 nqn=nqn.1994-11.com.samsung:nvme:990PRO:M.2:S7HRNJ0X201019H state=live
resources: irq:16 memory:71400000-71403fff *-namespace:0 description: NVMe disk physical id: 0 logical name: hwmon0 *-namespace:1 description: NVMe disk physical id: 2 logical name: /dev/ng0n1 *-namespace:2 description: NVMe disk physical id: 1 bus info: nvme@0:1 logical name: /dev/nvme0n1 serial: [redacted] size: 3726GiB capacity: 3726GiB capabilities: lvm2configuration: logicalsectorsize=512 sectorsize=512 wwid=eui.0025384241403599
[...] *-generic description: NVMe device product: Samsung SSD 990 PRO with Heatsink 4TB vendor: Samsung Electronics Co Ltd physical id: 0 bus info: pci@0000:02:00.0 logical name: /dev/nvme1 version: 0B2QJXG7 serial: S7HRNJ0WC05772H width: 32 bits clock: 66MHz capabilities: bus_master vga_palette cap_listconfiguration: driver=nvme latency=255 maxlatency=255 mingnt=255 nqn=nqn.1994-11.com.samsung:nvme:990PRO:M.2:S7HRNJ0WC05772H state=dead
resources: irq:16 memory:71300000-71303fff *-namespace:0 description: NVMe disk physical id: 0 logical name: hwmon1 *-namespace:1 description: NVMe disk physical id: 2 logical name: /dev/ng1n1 *-namespace:2 description: NVMe disk physical id: 1 bus info: nvme@1:1 logical name: /dev/nvme1n1configuration: logicalsectorsize=512 sectorsize=512 wwid=eui.0025384c3144c366
[...] ``` -- David LAROCHETTE Move Solutions / Direction technique Développeur, administrateur systèmes et réseaux
OpenPGP_0xD7D5E9DAB1B74AD0.asc
Description: OpenPGP public key
OpenPGP_signature.asc
Description: OpenPGP digital signature