*** This bug is a duplicate of bug 1678184 ***
    https://bugs.launchpad.net/bugs/1678184

I'm having the same problem with my Dell Precision 5510. As soon as the crash 
occurs the file system is mounted in read-only mode and a few seconds or 
minutes later the entire machine crashes. The problem occurs with the following 
kernels that I tested:
- linux-image-4.10.0-20-generic
- linux-image-4.10.0-19-generic
- linux-image-4.11.0-041100rc7-generic_4.11.0-041100rc7.201704161731


Dell Inc. Precision 5510/08R8KJ, BIOS 01.01.19 01/25/2016

I configured my laptop to send the kernel messages via syslog to another
machine. Below are the messages from the crash.

*** Before the crash ***
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.110539] xhci_hcd 0000:0a:00.0: 
remove, state 1
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.110543] usb usb4: USB 
disconnect, device number 1
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.110544] usb 4-1: USB 
disconnect, device number 2
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202761] xhci_hcd 0000:0a:00.0: 
Host halt failed, -19
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202781] xhci_hcd 0000:0a:00.0: 
Host not accessible, reset failed.
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202782] xhci_hcd 0000:0a:00.0: 
USB bus 4 deregistered
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202787] xhci_hcd 0000:0a:00.0: 
remove, state 4
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.202805] usb usb3: USB 
disconnect, device number 1
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.203066] xhci_hcd 0000:0a:00.0: 
USB bus 3 deregistered
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.246781] pci_bus 0000:3e: 
busn_res: can not insert [bus 3e] under [bus 07-0a] (conflicts with (null) [bus 
07-0a])
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.246812] pci 0000:3e:00.0: 
[8086:15b5] type 00 class 0x0c0330
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.246831] pci 0000:3e:00.0: reg 
0x10: [mem 0xd9f00000-0xd9f0ffff]
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247002] pci 0000:3e:00.0: 
supports D1 D2
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247003] pci 0000:3e:00.0: PME# 
supported from D0 D1 D2 D3hot D3cold
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247161] pcieport 0000:07:02.0: 
PCI bridge to [bus 3e]
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247169] pcieport 0000:07:02.0:  
 bridge window [mem 0xd9f00000-0xd9ffffff]
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247175] pci_bus 0000:3e: [bus 
3e] partially hidden behind bridge 0000:07 [bus 07-0a]
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247197] pci_bus 0000:07: 
Allocating resources
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247577] xhci_hcd 0000:3e:00.0: 
xHCI Host Controller
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.247582] xhci_hcd 0000:3e:00.0: 
new USB bus registered, assigned bus number 3
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.248805] xhci_hcd 0000:3e:00.0: 
hcc params 0x200077c1 hci version 0x110 quirks 0x00009810
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249199] usb usb3: New USB 
device found, idVendor=1d6b, idProduct=0002
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249200] usb usb3: New USB 
device strings: Mfr=3, Product=2, SerialNumber=1
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249201] usb usb3: Product: xHCI 
Host Controller
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249202] usb usb3: Manufacturer: 
Linux 4.10.0-20-generic xhci-hcd
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249202] usb usb3: SerialNumber: 
0000:3e:00.0
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249373] hub 3-0:1.0: USB hub 
found
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249435] hub 3-0:1.0: 2 ports 
detected
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249603] xhci_hcd 0000:3e:00.0: 
xHCI Host Controller
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249605] xhci_hcd 0000:3e:00.0: 
new USB bus registered, assigned bus number 4
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249799] usb usb4: New USB 
device found, idVendor=1d6b, idProduct=0003
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249800] usb usb4: New USB 
device strings: Mfr=3, Product=2, SerialNumber=1
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249801] usb usb4: Product: xHCI 
Host Controller
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249802] usb usb4: Manufacturer: 
Linux 4.10.0-20-generic xhci-hcd
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.249802] usb usb4: SerialNumber: 
0000:3e:00.0
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.250032] hub 4-0:1.0: USB hub 
found
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.250053] hub 4-0:1.0: 2 ports 
detected
kern.log.1:Apr 27 14:55:08 arnox kernel: [ 1118.594431] usb 4-1: new SuperSpeed 
USB device number 2 using xhci_hcd
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614826] usb 4-1: New USB device 
found, idVendor=0bda, idProduct=8153
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614840] usb 4-1: New USB device 
strings: Mfr=1, Product=2, SerialNumber=6
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614841] usb 4-1: Product: USB 
10/100/1000 LAN
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614841] usb 4-1: Manufacturer: 
Realtek
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.614842] usb 4-1: SerialNumber: 
000001000000
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.734678] usb 4-1: reset 
SuperSpeed USB device number 2 using xhci_hcd
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.756975] r8152 4-1:1.0 (unnamed 
net_device) (uninitialized): Using pass-thru MAC addr f8:ca:b8:6a:33:d5
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.811163] r8152 4-1:1.0 eth0: 
v1.08.8
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.825557] r8152 4-1:1.0 
enxf8cab86a33d5: renamed from eth0
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.859589] IPv6: 
ADDRCONF(NETDEV_UP): enxf8cab86a33d5: link is not ready
kern.log.1:Apr 27 14:55:09 arnox kernel: [ 1118.888334] IPv6: 
ADDRCONF(NETDEV_UP): enxf8cab86a33d5: link is not ready
kern.log.1:Apr 27 14:55:12 arnox kernel: [ 1122.202756] r8152 4-1:1.0 
enxf8cab86a33d5: carrier on
kern.log.1:Apr 27 14:55:12 arnox kernel: [ 1122.202763] IPv6: 
ADDRCONF(NETDEV_CHANGE): enxf8cab86a33d5: link becomes ready
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031508] CPU0: Core temperature 
above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031509] CPU4: Core temperature 
above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031510] CPU7: Package 
temperature above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031510] CPU3: Package 
temperature above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031513] CPU1: Package 
temperature above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031513] CPU5: Package 
temperature above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031514] CPU4: Package 
temperature above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031538] CPU0: Package 
temperature above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031540] mce: [Hardware Error]: 
Machine check events logged
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031541] CPU2: Package 
temperature above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031542] mce: [Hardware Error]: 
Machine check events logged
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031542] CPU6: Package 
temperature above threshold, cpu clock throttled (total events = 1)
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031568] mce: [Hardware Error]: 
CPU 4: Machine Check: 0 Bank 128: 0000000088030803
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031569] mce: [Hardware Error]: 
TSC 357cca83e4d 
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031584] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493297880 SOCKET 0 APIC 1 microcode 74
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031587] mce: [Hardware Error]: 
CPU 0: Machine Check: 0 Bank 128: 0000000088030803
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031589] mce: [Hardware Error]: 
TSC 357cca97135 
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.031611] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493297880 SOCKET 0 APIC 0 microcode 74
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032481] CPU0: Core 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032482] CPU4: Core 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032482] CPU0: Package 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032483] CPU4: Package 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032489] mce: [Hardware Error]: 
CPU 0: Machine Check: 0 Bank 128: 0000000088050802
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032491] mce: [Hardware Error]: 
TSC 357ccd1e67f 
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032492] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493297880 SOCKET 0 APIC 0 microcode 74
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032493] mce: [Hardware Error]: 
CPU 4: Machine Check: 0 Bank 128: 0000000088050802
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032516] mce: [Hardware Error]: 
TSC 357ccd1ee76 
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032517] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493297880 SOCKET 0 APIC 1 microcode 74
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032521] CPU2: Package 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032521] CPU6: Package 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032522] CPU5: Package 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032523] CPU1: Package 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032523] CPU3: Package 
temperature/speed normal
kern.log.1:Apr 27 14:58:00 arnox kernel: [ 1290.032524] CPU7: Package 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900515] CPU4: Core temperature 
above threshold, cpu clock throttled (total events = 3567)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900515] CPU0: Core temperature 
above threshold, cpu clock throttled (total events = 3567)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900518] CPU0: Package 
temperature above threshold, cpu clock throttled (total events = 4365)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900520] CPU4: Package 
temperature above threshold, cpu clock throttled (total events = 4365)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900524] mce_notify_irq: 1 
callbacks suppressed
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900525] mce: [Hardware Error]: 
Machine check events logged
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900556] CPU5: Package 
temperature above threshold, cpu clock throttled (total events = 4365)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900557] CPU1: Package 
temperature above threshold, cpu clock throttled (total events = 4365)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900558] CPU2: Package 
temperature above threshold, cpu clock throttled (total events = 4365)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900559] CPU6: Package 
temperature above threshold, cpu clock throttled (total events = 4365)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900560] CPU3: Package 
temperature above threshold, cpu clock throttled (total events = 4365)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900574] CPU7: Package 
temperature above threshold, cpu clock throttled (total events = 4365)
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900577] mce: [Hardware Error]: 
CPU 0: Machine Check: 0 Bank 128: 0000000088030c03
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900579] mce: [Hardware Error]: 
TSC 41dcfcb90f1 
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900583] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298183 SOCKET 0 APIC 0 microcode 74
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900585] mce: [Hardware Error]: 
CPU 4: Machine Check: 0 Bank 128: 0000000088030c03
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900586] mce: [Hardware Error]: 
TSC 41dcfcbb2fa 
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.900589] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298183 SOCKET 0 APIC 1 microcode 74
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901528] CPU4: Core 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901528] CPU0: Core 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901529] CPU4: Package 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901530] CPU0: Package 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901530] mce: [Hardware Error]: 
Machine check events logged
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901537] mce: [Hardware Error]: 
CPU 4: Machine Check: 0 Bank 128: 0000000088040c02
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901539] mce: [Hardware Error]: 
TSC 41dcff6eea8 
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901563] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298183 SOCKET 0 APIC 1 microcode 74
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901564] mce: [Hardware Error]: 
CPU 0: Machine Check: 0 Bank 128: 0000000088040c02
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901565] mce: [Hardware Error]: 
TSC 41dcff6f6f4 
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901566] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298183 SOCKET 0 APIC 0 microcode 74
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901568] CPU6: Package 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901568] CPU2: Package 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901569] CPU1: Package 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901570] CPU5: Package 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901570] CPU7: Package 
temperature/speed normal
kern.log.1:Apr 27 15:03:03 arnox kernel: [ 1592.901571] CPU3: Package 
temperature/speed normal
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946524] CPU2: Core temperature 
above threshold, cpu clock throttled (total events = 799)
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946525] CPU6: Core temperature 
above threshold, cpu clock throttled (total events = 799)
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946565] mce: [Hardware Error]: 
Machine check events logged
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946602] mce: [Hardware Error]: 
CPU 6: Machine Check: 0 Bank 128: 0000000088030c03
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946604] mce: [Hardware Error]: 
TSC 45bf3a18c27 
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946606] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298278 SOCKET 0 APIC 5 microcode 74
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946608] mce: [Hardware Error]: 
CPU 2: Machine Check: 0 Bank 128: 0000000088030c03
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946609] mce: [Hardware Error]: 
TSC 45bf3a1d0c3 
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.946610] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298278 SOCKET 0 APIC 4 microcode 74
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947504] CPU2: Core 
temperature/speed normal
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947505] CPU6: Core 
temperature/speed normal
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947506] mce: [Hardware Error]: 
Machine check events logged
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947514] mce: [Hardware Error]: 
CPU 2: Machine Check: 0 Bank 128: 0000000088040c02
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947540] mce: [Hardware Error]: 
TSC 45bf3cb7070 
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947542] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298278 SOCKET 0 APIC 4 microcode 74
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947543] mce: [Hardware Error]: 
CPU 6: Machine Check: 0 Bank 128: 0000000088040c02
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947544] mce: [Hardware Error]: 
TSC 45bf3cb76a4 
kern.log.1:Apr 27 15:04:38 arnox kernel: [ 1687.947545] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298278 SOCKET 0 APIC 5 microcode 74
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906514] CPU6: Package 
temperature above threshold, cpu clock throttled (total events = 6643)
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906515] CPU2: Package 
temperature above threshold, cpu clock throttled (total events = 6643)
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906554] CPU4: Package 
temperature above threshold, cpu clock throttled (total events = 6643)
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906555] CPU7: Package 
temperature above threshold, cpu clock throttled (total events = 6643)
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906556] CPU0: Package 
temperature above threshold, cpu clock throttled (total events = 6643)
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906557] CPU5: Package 
temperature above threshold, cpu clock throttled (total events = 6643)
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906572] CPU1: Package 
temperature above threshold, cpu clock throttled (total events = 6643)
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.906573] CPU3: Package 
temperature above threshold, cpu clock throttled (total events = 6643)
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907486] CPU6: Package 
temperature/speed normal
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907486] CPU2: Package 
temperature/speed normal
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907525] CPU0: Package 
temperature/speed normal
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907526] CPU5: Package 
temperature/speed normal
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907526] CPU4: Package 
temperature/speed normal
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907527] CPU1: Package 
temperature/speed normal
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907528] CPU3: Package 
temperature/speed normal
kern.log.1:Apr 27 15:08:03 arnox kernel: [ 1892.907528] CPU7: Package 
temperature/speed normal
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603494] CPU7: Core temperature 
above threshold, cpu clock throttled (total events = 102)
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603494] CPU3: Core temperature 
above threshold, cpu clock throttled (total events = 102)
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603501] mce: [Hardware Error]: 
Machine check events logged
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603552] mce: [Hardware Error]: 
CPU 3: Machine Check: 0 Bank 128: 0000000088030803
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603555] mce: [Hardware Error]: 
TSC 527b59bd192 
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603559] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298589 SOCKET 0 APIC 6 microcode 74
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603574] mce: [Hardware Error]: 
CPU 7: Machine Check: 0 Bank 128: 0000000088030803
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603575] mce: [Hardware Error]: 
TSC 527b59be8c3 
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.603577] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298589 SOCKET 0 APIC 7 microcode 74
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604533] CPU7: Core 
temperature/speed normal
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604535] CPU3: Core 
temperature/speed normal
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604536] mce: [Hardware Error]: 
Machine check events logged
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604542] mce: [Hardware Error]: 
CPU 7: Machine Check: 0 Bank 128: 0000000088060802
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604543] mce: [Hardware Error]: 
TSC 527b5c85364 
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604565] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298589 SOCKET 0 APIC 7 microcode 74
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604566] mce: [Hardware Error]: 
CPU 3: Machine Check: 0 Bank 128: 0000000088060802
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604566] mce: [Hardware Error]: 
TSC 527b5c86d83 
kern.log.1:Apr 27 15:09:49 arnox kernel: [ 1999.604567] mce: [Hardware Error]: 
PROCESSOR 0:506e3 TIME 1493298589 SOCKET 0 APIC 6 microcode 74

*** The crash ***
kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.650456] nvme 0000:04:00.0: 
controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff
kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.664491] nvme 0000:04:00.0: 
enabling device (0000 -> 0002)
kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.664542] nvme nvme0: Removing 
after probe failure status: -19
kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.664549] nvme0n1: detected 
capacity change from 1024209543168 to 0
kern.log.1:Apr 27 16:27:35 arnox kernel: [ 6664.665083] blk_update_request: I/O 
error, dev nvme0n1, sector 0
kern.log.1:Apr 27 16:27:40 arnox kernel: [ 6670.584431] Aborting journal on 
device dm-1-8.
kern.log.1:Apr 27 16:27:40 arnox kernel: [ 6670.584554] Buffer I/O error on dev 
dm-1, logical block 117473280, lost sync page write
kern.log.1:Apr 27 16:27:40 arnox kernel: [ 6670.584625] JBD2: Error -5 detected 
when updating journal superblock for dm-1-8.
kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588668] Buffer I/O error on dev 
dm-1, logical block 0, lost sync page write
kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588718] EXT4-fs error (device 
dm-1): ext4_journal_check_start:56: Detected aborted journal
kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588720] EXT4-fs (dm-1): 
Remounting filesystem read-only
kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588739] EXT4-fs (dm-1): 
previous I/O error to superblock detected
kern.log.1:Apr 27 16:27:41 arnox kernel: [ 6670.588792] Buffer I/O error on dev 
dm-1, logical block 0, lost sync page write
kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.097976] EXT4-fs warning (device 
dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error 
-5 reading directory block
kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.098084] EXT4-fs warning (device 
dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error 
-5 reading directory block
kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.098519] EXT4-fs warning (device 
dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error 
-5 reading directory block
kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.099092] EXT4-fs warning (device 
dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error 
-5 reading directory block
kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.099186] EXT4-fs warning (device 
dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error 
-5 reading directory block
kern.log.1:Apr 27 16:37:57 arnox kernel: [ 7287.099378] EXT4-fs warning (device 
dm-1): ext4_dx_find_entry:1532: inode #50594233: lblock 5: comm owncloud: error 
-5 reading directory block

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-signed in Ubuntu.
https://bugs.launchpad.net/bugs/1682704

Title:
  nvme controller is down will reset (regression in zesty on XPS laptop)

Status in linux-signed package in Ubuntu:
  Confirmed

Bug description:
  I've just upgraded a Dell XPS 15" (9550, early 2016 model) with a
  Samsung NVME drive. Machine was stable under Kubuntu 16.10 with the
  same drive. After the upgrade to Zesty I've now seen 3 hard lockups
  (machine loses root fs) with the following message printed:

      nvme controller is down will reset

  there are also messages printed to the virtual console reporting
  failure to write to the underlying disk from the home-directory encfs.

  Linux tass 4.10.0-19-generic #21-Ubuntu SMP Thu Apr 6 17:04:57 UTC
  2017 x86_64 x86_64 x86_64 GNU/Linux

  Ubuntu 17.04 (Kubuntu)

  dmesg about nvme:
  [    1.748864] nvme nvme0: pci function 0000:04:00.0
  [    1.864553]  nvme0n1: p1 p2 p3 p4 p5 p6
  [    2.961181] EXT4-fs (nvme0n1p6): mounted filesystem with ordered data 
mode. Opts: (null)
  [    4.172546] EXT4-fs (nvme0n1p6): re-mounted. Opts: errors=remount-ro

  NVME cli shows 57 errors in the error-log, all seeming to be invalid
  field or invalid namespace. Not sure if that's since boot or since
  machine creation.

  Smartctrl shows...
  smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.10.0-19-generic] (local build)
  Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

  === START OF INFORMATION SECTION ===
  Model Number:                       PM951 NVMe SAMSUNG 512GB
  Serial Number:                      S29PNXAH142328
  Firmware Version:                   BXV77D0Q
  PCI Vendor/Subsystem ID:            0x144d
  IEEE OUI Identifier:                0x002538
  Controller ID:                      1
  Number of Namespaces:               1
  Namespace 1 Size/Capacity:          512,110,190,592 [512 GB]
  Namespace 1 Utilization:            365,503,283,200 [365 GB]
  Namespace 1 Formatted LBA Size:     512
  Local Time is:                      Thu Apr 13 23:21:32 2017 EDT
  Firmware Updates (0x06):            3 Slots
  Optional Admin Commands (0x0017):   Security Format Frmw_DL *Other*
  Optional NVM Commands (0x001f):     Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat
  Maximum Data Transfer Size:         32 Pages

  Supported Power States
  St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
   0 +     6.00W       -        -    0  0  0  0        5       5
   1 +     4.20W       -        -    1  1  1  1       30      30
   2 +     3.10W       -        -    2  2  2  2      100     100
   3 -   0.0700W       -        -    3  3  3  3      500    5000
   4 -   0.0050W       -        -    4  4  4  4     2000   22000

  Supported LBA Sizes (NSID 0x1)
  Id Fmt  Data  Metadt  Rel_Perf
   0 +     512       0         0

  === START OF SMART DATA SECTION ===
  SMART overall-health self-assessment test result: PASSED

  SMART/Health Information (NVMe Log 0x02, NSID 0xffffffff)
  Critical Warning:                   0x00
  Temperature:                        35 Celsius
  Available Spare:                    100%
  Available Spare Threshold:          50%
  Percentage Used:                    0%
  Data Units Read:                    2,724,346 [1.39 TB]
  Data Units Written:                 6,568,756 [3.36 TB]
  Host Read Commands:                 52,921,997
  Host Write Commands:                157,530,880
  Controller Busy Time:               1,349
  Power Cycles:                       831
  Power On Hours:                     5,358
  Unsafe Shutdowns:                   46
  Media and Data Integrity Errors:    0
  Error Information Log Entries:      57

  Error Information (NVMe Log 0x01, max 64 entries)
  Num   ErrCount  SQId   CmdId  Status  PELoc          LBA  NSID    VS
    0         57     0  0x0004  0x4016  0x000            0     1     -
    1         56     0  0x0004  0x4016  0x000            0     1     -
    2         55     0  0x0004  0x4016  0x000            0     1     -
    3         54     0  0x0004  0x4016  0x000            0     1     -
    4         53     0  0x0004  0x4016  0x000            0     1     -
    5         52     0  0x0004  0x4016  0x000            0     1     -
    6         51     0  0x0004  0x4016  0x000            0     1     -
    7         50     0  0x0004  0x4016  0x000            0     1     -
    8         49     0  0x001f  0x4004  0x000            0     0     -
    9         48     0  0x001e  0x4004  0x000            0     0     -
   10         47     0  0x001f  0x4004  0x000            0     0     -
   11         46     0  0x001e  0x4004  0x000            0     0     -
   12         45     0  0x001f  0x4004  0x000            0     0     -
   13         44     0  0x001e  0x4004  0x000            0     0     -
   14         43     0  0x0000  0x4016  0x000            0     1     -
   15         42     0  0x0004  0x4016  0x000            0     1     -
  ... (41 entries not shown)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-signed/+bug/1682704/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to