*** This bug is a duplicate of bug 1678184 *** https://bugs.launchpad.net/bugs/1678184
I'm having the same problem with my Dell Precision 5510. As soon as the crash occurs the file system is mounted in read-only mode and a few seconds or minutes later the entire machine crashes. The problem occurs with the following kernels that I tested: - linux-image-4.10.0-20-generic - linux-image-4.10.0-19-generic - linux-image-4.11.0-041100rc7-generic_4.11.0-041100rc7.201704161731 Dell Inc. Precision 5510/08R8KJ, BIOS 01.01.19 01/25/2016 I configured my laptop to send the kernel messages via syslog to another machine. Below are the messages from the crash. *** Before the crash *** kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.110539] xhci_hcd 0000:0a:00.0: remove, state 1 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.110543] usb usb4: USB disconnect, device number 1 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.110544] usb 4-1: USB disconnect, device number 2 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202761] xhci_hcd 0000:0a:00.0: Host halt failed, -19 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202781] xhci_hcd 0000:0a:00.0: Host not accessible, reset failed. kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202782] xhci_hcd 0000:0a:00.0: USB bus 4 deregistered kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202787] xhci_hcd 0000:0a:00.0: remove, state 4 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202805] usb usb3: USB disconnect, device number 1 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.203066] xhci_hcd 0000:0a:00.0: USB bus 3 deregistered kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.246781] pci_bus 0000:3e: busn_res: can not insert [bus 3e] under [bus 07-0a] (conflicts with (null) [bus 07-0a]) kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.246812] pci 0000:3e:00.0: [8086:15b5] type 00 class 0x0c0330 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.246831] pci 0000:3e:00.0: reg 0x10: [mem 0xd9f00000-0xd9f0ffff] kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247002] pci 0000:3e:00.0: supports D1 D2 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247003] pci 0000:3e:00.0: PME# supported from D0 D1 D2 D3hot D3cold kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247161] pcieport 0000:07:02.0: PCI bridge to [bus 3e] kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247169] pcieport 0000:07:02.0: bridge window [mem 0xd9f00000-0xd9ffffff] kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247175] pci_bus 0000:3e: [bus 3e] partially hidden behind bridge 0000:07 [bus 07-0a] kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247197] pci_bus 0000:07: Allocating resources kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247577] xhci_hcd 0000:3e:00.0: xHCI Host Controller kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247582] xhci_hcd 0000:3e:00.0: new USB bus registered, assigned bus number 3 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.248805] xhci_hcd 0000:3e:00.0: hcc params 0x200077c1 hci version 0x110 quirks 0x00009810 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249199] usb usb3: New USB device found, idVendor=1d6b, idProduct=0002 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249200] usb usb3: New USB device strings: Mfr=3, Product=2, SerialNumber=1 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249201] usb usb3: Product: xHCI Host Controller kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249202] usb usb3: Manufacturer: Linux 4.10.0-20-generic xhci-hcd kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249202] usb usb3: SerialNumber: 0000:3e:00.0 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249373] hub 3-0:1.0: USB hub found kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249435] hub 3-0:1.0: 2 ports detected kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249603] xhci_hcd 0000:3e:00.0: xHCI Host Controller kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249605] xhci_hcd 0000:3e:00.0: new USB bus registered, assigned bus number 4 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249799] usb usb4: New USB device found, idVendor=1d6b, idProduct=0003 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249800] usb usb4: New USB device strings: Mfr=3, Product=2, SerialNumber=1 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249801] usb usb4: Product: xHCI Host Controller kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249802] usb usb4: Manufacturer: Linux 4.10.0-20-generic xhci-hcd kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249802] usb usb4: SerialNumber: 0000:3e:00.0 kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.250032] hub 4-0:1.0: USB hub found kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.250053] hub 4-0:1.0: 2 ports detected kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.594431] usb 4-1: new SuperSpeed USB device number 2 using xhci_hcd kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614826] usb 4-1: New USB device found, idVendor=0bda, idProduct=8153 kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614840] usb 4-1: New USB device strings: Mfr=1, Product=2, SerialNumber=6 kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614841] usb 4-1: Product: USB 10/100/1000 LAN kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614841] usb 4-1: Manufacturer: Realtek kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614842] usb 4-1: SerialNumber: 000001000000 kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.734678] usb 4-1: reset SuperSpeed USB device number 2 using xhci_hcd kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.756975] r8152 4-1:1.0 (unnamed net_device) (uninitialized): Using pass-thru MAC addr f8:ca:b8:6a:33:d5 kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.811163] r8152 4-1:1.0 eth0: v1.08.8 kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.825557] r8152 4-1:1.0 enxf8cab86a33d5: renamed from eth0 kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.859589] IPv6: ADDRCONF(NETDEV_UP): enxf8cab86a33d5: link is not ready kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.888334] IPv6: ADDRCONF(NETDEV_UP): enxf8cab86a33d5: link is not ready kern.log.1:Apr 27 14:55:12 arnox kernel: [ 1122.202756] r8152 4-1:1.0 enxf8cab86a33d5: carrier on kern.log.1:Apr 27 14:55:12 arnox kernel: [ 1122.202763] IPv6: ADDRCONF(NETDEV_CHANGE): enxf8cab86a33d5: link becomes ready kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031508] CPU0: Core temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031509] CPU4: Core temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031510] CPU7: Package temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031510] CPU3: Package temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031513] CPU1: Package temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031513] CPU5: Package temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031514] CPU4: Package temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031538] CPU0: Package temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031540] mce: [Hardware Error]: Machine check events logged kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031541] CPU2: Package temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031542] mce: [Hardware Error]: Machine check events logged kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031542] CPU6: Package temperature above threshold, cpu clock throttled (total events = 1) kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031568] mce: [Hardware Error]: CPU 4: Machine Check: 0 Bank 128: 0000000088030803 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031569] mce: [Hardware Error]: TSC 357cca83e4d kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031584] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493297880 SOCKET 0 APIC 1 microcode 74 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031587] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 128: 0000000088030803 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031589] mce: [Hardware Error]: TSC 357cca97135 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031611] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493297880 SOCKET 0 APIC 0 microcode 74 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032481] CPU0: Core temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032482] CPU4: Core temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032482] CPU0: Package temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032483] CPU4: Package temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032489] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 128: 0000000088050802 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032491] mce: [Hardware Error]: TSC 357ccd1e67f kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032492] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493297880 SOCKET 0 APIC 0 microcode 74 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032493] mce: [Hardware Error]: CPU 4: Machine Check: 0 Bank 128: 0000000088050802 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032516] mce: [Hardware Error]: TSC 357ccd1ee76 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032517] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493297880 SOCKET 0 APIC 1 microcode 74 kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032521] CPU2: Package temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032521] CPU6: Package temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032522] CPU5: Package temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032523] CPU1: Package temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032523] CPU3: Package temperature/speed normal kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032524] CPU7: Package temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900515] CPU4: Core temperature above threshold, cpu clock throttled (total events = 3567) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900515] CPU0: Core temperature above threshold, cpu clock throttled (total events = 3567) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900518] CPU0: Package temperature above threshold, cpu clock throttled (total events = 4365) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900520] CPU4: Package temperature above threshold, cpu clock throttled (total events = 4365) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900524] mce_notify_irq: 1 callbacks suppressed kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900525] mce: [Hardware Error]: Machine check events logged kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900556] CPU5: Package temperature above threshold, cpu clock throttled (total events = 4365) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900557] CPU1: Package temperature above threshold, cpu clock throttled (total events = 4365) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900558] CPU2: Package temperature above threshold, cpu clock throttled (total events = 4365) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900559] CPU6: Package temperature above threshold, cpu clock throttled (total events = 4365) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900560] CPU3: Package temperature above threshold, cpu clock throttled (total events = 4365) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900574] CPU7: Package temperature above threshold, cpu clock throttled (total events = 4365) kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900577] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 128: 0000000088030c03 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900579] mce: [Hardware Error]: TSC 41dcfcb90f1 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900583] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298183 SOCKET 0 APIC 0 microcode 74 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900585] mce: [Hardware Error]: CPU 4: Machine Check: 0 Bank 128: 0000000088030c03 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900586] mce: [Hardware Error]: TSC 41dcfcbb2fa kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900589] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298183 SOCKET 0 APIC 1 microcode 74 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901528] CPU4: Core temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901528] CPU0: Core temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901529] CPU4: Package temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901530] CPU0: Package temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901530] mce: [Hardware Error]: Machine check events logged kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901537] mce: [Hardware Error]: CPU 4: Machine Check: 0 Bank 128: 0000000088040c02 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901539] mce: [Hardware Error]: TSC 41dcff6eea8 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901563] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298183 SOCKET 0 APIC 1 microcode 74 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901564] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 128: 0000000088040c02 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901565] mce: [Hardware Error]: TSC 41dcff6f6f4 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901566] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298183 SOCKET 0 APIC 0 microcode 74 kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901568] CPU6: Package temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901568] CPU2: Package temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901569] CPU1: Package temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901570] CPU5: Package temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901570] CPU7: Package temperature/speed normal kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901571] CPU3: Package temperature/speed normal kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946524] CPU2: Core temperature above threshold, cpu clock throttled (total events = 799) kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946525] CPU6: Core temperature above threshold, cpu clock throttled (total events = 799) kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946565] mce: [Hardware Error]: Machine check events logged kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946602] mce: [Hardware Error]: CPU 6: Machine Check: 0 Bank 128: 0000000088030c03 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946604] mce: [Hardware Error]: TSC 45bf3a18c27 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946606] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298278 SOCKET 0 APIC 5 microcode 74 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946608] mce: [Hardware Error]: CPU 2: Machine Check: 0 Bank 128: 0000000088030c03 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946609] mce: [Hardware Error]: TSC 45bf3a1d0c3 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946610] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298278 SOCKET 0 APIC 4 microcode 74 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947504] CPU2: Core temperature/speed normal kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947505] CPU6: Core temperature/speed normal kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947506] mce: [Hardware Error]: Machine check events logged kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947514] mce: [Hardware Error]: CPU 2: Machine Check: 0 Bank 128: 0000000088040c02 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947540] mce: [Hardware Error]: TSC 45bf3cb7070 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947542] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298278 SOCKET 0 APIC 4 microcode 74 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947543] mce: [Hardware Error]: CPU 6: Machine Check: 0 Bank 128: 0000000088040c02 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947544] mce: [Hardware Error]: TSC 45bf3cb76a4 kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947545] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298278 SOCKET 0 APIC 5 microcode 74 kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906514] CPU6: Package temperature above threshold, cpu clock throttled (total events = 6643) kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906515] CPU2: Package temperature above threshold, cpu clock throttled (total events = 6643) kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906554] CPU4: Package temperature above threshold, cpu clock throttled (total events = 6643) kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906555] CPU7: Package temperature above threshold, cpu clock throttled (total events = 6643) kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906556] CPU0: Package temperature above threshold, cpu clock throttled (total events = 6643) kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906557] CPU5: Package temperature above threshold, cpu clock throttled (total events = 6643) kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906572] CPU1: Package temperature above threshold, cpu clock throttled (total events = 6643) kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906573] CPU3: Package temperature above threshold, cpu clock throttled (total events = 6643) kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907486] CPU6: Package temperature/speed normal kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907486] CPU2: Package temperature/speed normal kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907525] CPU0: Package temperature/speed normal kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907526] CPU5: Package temperature/speed normal kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907526] CPU4: Package temperature/speed normal kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907527] CPU1: Package temperature/speed normal kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907528] CPU3: Package temperature/speed normal kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907528] CPU7: Package temperature/speed normal kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603494] CPU7: Core temperature above threshold, cpu clock throttled (total events = 102) kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603494] CPU3: Core temperature above threshold, cpu clock throttled (total events = 102) kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603501] mce: [Hardware Error]: Machine check events logged kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603552] mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 128: 0000000088030803 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603555] mce: [Hardware Error]: TSC 527b59bd192 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603559] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298589 SOCKET 0 APIC 6 microcode 74 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603574] mce: [Hardware Error]: CPU 7: Machine Check: 0 Bank 128: 0000000088030803 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603575] mce: [Hardware Error]: TSC 527b59be8c3 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603577] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298589 SOCKET 0 APIC 7 microcode 74 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604533] CPU7: Core temperature/speed normal kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604535] CPU3: Core temperature/speed normal kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604536] mce: [Hardware Error]: Machine check events logged kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604542] mce: [Hardware Error]: CPU 7: Machine Check: 0 Bank 128: 0000000088060802 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604543] mce: [Hardware Error]: TSC 527b5c85364 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604565] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298589 SOCKET 0 APIC 7 microcode 74 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604566] mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 128: 0000000088060802 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604566] mce: [Hardware Error]: TSC 527b5c86d83 kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604567] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1493298589 SOCKET 0 APIC 6 microcode 74 *** The crash *** kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.650456] nvme 0000:04:00.0: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.664491] nvme 0000:04:00.0: enabling device (0000 -> 0002) kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.664542] nvme nvme0: Removing after probe failure status: -19 kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.664549] nvme0n1: detected capacity change from 1024209543168 to 0 kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.665083] blk_update_request: I/O error, dev nvme0n1, sector 0 kern.log.1:Apr 27 16:27:40 arnox kernel: [ 6670.584431] Aborting journal on device dm-1-8. kern.log.1:Apr 27 16:27:40 arnox kernel: [ 6670.584554] Buffer I/O error on dev dm-1, logical block 117473280, lost sync page write kern.log.1:Apr 27 16:27:40 arnox kernel: [ 6670.584625] JBD2: Error -5 detected when updating journal superblock for dm-1-8. kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588668] Buffer I/O error on dev dm-1, logical block 0, lost sync page write kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588718] EXT4-fs error (device dm-1): ext4_journal_check_start:56: Detected aborted journal kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588720] EXT4-fs (dm-1): Remounting filesystem read-only kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588739] EXT4-fs (dm-1): previous I/O error to superblock detected kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588792] Buffer I/O error on dev dm-1, logical block 0, lost sync page write kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.097976] EXT4-fs warning (device dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error -5 reading directory block kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.098084] EXT4-fs warning (device dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error -5 reading directory block kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.098519] EXT4-fs warning (device dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error -5 reading directory block kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.099092] EXT4-fs warning (device dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error -5 reading directory block kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.099186] EXT4-fs warning (device dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error -5 reading directory block kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.099378] EXT4-fs warning (device dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error -5 reading directory block -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-signed in Ubuntu. https://bugs.launchpad.net/bugs/1682704 Title: nvme controller is down will reset (regression in zesty on XPS laptop) Status in linux-signed package in Ubuntu: Confirmed Bug description: I've just upgraded a Dell XPS 15" (9550, early 2016 model) with a Samsung NVME drive. Machine was stable under Kubuntu 16.10 with the same drive. After the upgrade to Zesty I've now seen 3 hard lockups (machine loses root fs) with the following message printed: nvme controller is down will reset there are also messages printed to the virtual console reporting failure to write to the underlying disk from the home-directory encfs. Linux tass 4.10.0-19-generic #21-Ubuntu SMP Thu Apr 6 17:04:57 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux Ubuntu 17.04 (Kubuntu) dmesg about nvme: [ 1.748864] nvme nvme0: pci function 0000:04:00.0 [ 1.864553] nvme0n1: p1 p2 p3 p4 p5 p6 [ 2.961181] EXT4-fs (nvme0n1p6): mounted filesystem with ordered data mode. Opts: (null) [ 4.172546] EXT4-fs (nvme0n1p6): re-mounted. Opts: errors=remount-ro NVME cli shows 57 errors in the error-log, all seeming to be invalid field or invalid namespace. Not sure if that's since boot or since machine creation. Smartctrl shows... smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.10.0-19-generic] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Number: PM951 NVMe SAMSUNG 512GB Serial Number: S29PNXAH142328 Firmware Version: BXV77D0Q PCI Vendor/Subsystem ID: 0x144d IEEE OUI Identifier: 0x002538 Controller ID: 1 Number of Namespaces: 1 Namespace 1 Size/Capacity: 512,110,190,592 [512 GB] Namespace 1 Utilization: 365,503,283,200 [365 GB] Namespace 1 Formatted LBA Size: 512 Local Time is: Thu Apr 13 23:21:32 2017 EDT Firmware Updates (0x06): 3 Slots Optional Admin Commands (0x0017): Security Format Frmw_DL *Other* Optional NVM Commands (0x001f): Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat Maximum Data Transfer Size: 32 Pages Supported Power States St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat 0 + 6.00W - - 0 0 0 0 5 5 1 + 4.20W - - 1 1 1 1 30 30 2 + 3.10W - - 2 2 2 2 100 100 3 - 0.0700W - - 3 3 3 3 500 5000 4 - 0.0050W - - 4 4 4 4 2000 22000 Supported LBA Sizes (NSID 0x1) Id Fmt Data Metadt Rel_Perf 0 + 512 0 0 === START OF SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED SMART/Health Information (NVMe Log 0x02, NSID 0xffffffff) Critical Warning: 0x00 Temperature: 35 Celsius Available Spare: 100% Available Spare Threshold: 50% Percentage Used: 0% Data Units Read: 2,724,346 [1.39 TB] Data Units Written: 6,568,756 [3.36 TB] Host Read Commands: 52,921,997 Host Write Commands: 157,530,880 Controller Busy Time: 1,349 Power Cycles: 831 Power On Hours: 5,358 Unsafe Shutdowns: 46 Media and Data Integrity Errors: 0 Error Information Log Entries: 57 Error Information (NVMe Log 0x01, max 64 entries) Num ErrCount SQId CmdId Status PELoc LBA NSID VS 0 57 0 0x0004 0x4016 0x000 0 1 - 1 56 0 0x0004 0x4016 0x000 0 1 - 2 55 0 0x0004 0x4016 0x000 0 1 - 3 54 0 0x0004 0x4016 0x000 0 1 - 4 53 0 0x0004 0x4016 0x000 0 1 - 5 52 0 0x0004 0x4016 0x000 0 1 - 6 51 0 0x0004 0x4016 0x000 0 1 - 7 50 0 0x0004 0x4016 0x000 0 1 - 8 49 0 0x001f 0x4004 0x000 0 0 - 9 48 0 0x001e 0x4004 0x000 0 0 - 10 47 0 0x001f 0x4004 0x000 0 0 - 11 46 0 0x001e 0x4004 0x000 0 0 - 12 45 0 0x001f 0x4004 0x000 0 0 - 13 44 0 0x001e 0x4004 0x000 0 0 - 14 43 0 0x0000 0x4016 0x000 0 1 - 15 42 0 0x0004 0x4016 0x000 0 1 - ... (41 entries not shown) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux-signed/+bug/1682704/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp