Bug description updated w/ SRU template. Patch submitted to kernel-team mailing list.
[SRU Z/X][PATCH] scsi: ipr: do not set DID_PASSTHROUGH on CHECK CONDITION https://lists.ubuntu.com/archives/kernel-team/2017-August/086518.html Requested for - Zesty/17.04 (target 16.04.3 HWE kernel / v4.10-based) and - Xenial/16.04 (GA kernel / v4.4-based). Already applied on - Artful/17.10 (16.04.4 HWE kernel). ** Description changed: + SRU Justification: + + Impact: stuck I/O to multipath disks with medium errors (on IPR controllers) + Fix: upstream commit for IPR driver to allow SCSI layer to handle the error + Testcase: perform I/O to a failing disk which is multipathed (on IPR + controller), which returns SCSI Medium Errors (without the fix, + the I/O gets stuck). + the commit message describes a test-case w/ sg_dd. + + ---Problem Description--- IPR driver causes multipath to fail paths/stuck IO on Medium Errors This problem is resolved with this upstream accepted patch, scheduled for 4.11. The detailed problem description and resolution are described in the commit message. > scsi: ipr: do not set DID_PASSTHROUGH on CHECK CONDITION - > http://git.kernel.org/?p=linux/kernel/git/jejb/scsi.git;a=commit;h=785a470496d8e0a32e3d39f376984eb2c98ca5b3 + > https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=785a470496d8e0a32e3d39f376984eb2c98ca5b3 - Please apply to 17.04 and 16.04. + Please apply to 17.04 (target 16.04.3 HWE kernel) and 16.04 (GA kernel). + Patch already applied to 17.10. The business justification for the SRU is: Clients with a dual-controller multipathed IPR configuration that eventually runs into failing disk/sectors, will experience an I/O hang once the drive reports a Medium Error, which can hang an application or even the root filesystem (whatever is doing I/O to the failing drive), potentially hanging the system. Thanks. - + ---Additional Hardware Info--- - Dual (IPR) controller setup, multipath enabled - + Dual (IPR) controller setup, multipath enabled + ---Steps to Reproduce--- 1) Use a disk with bad sectors (or force such condition, via internal/special tools) 2) Multipath that disk 3) Run IO to the multipath device on the bad sectors 4) Both paths will be failed, and IO is stuck due to queue_if_no_path (enabled by default for IPR) The detailed problem description and resolution are described in the commit message. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1682644 Title: IPR driver causes multipath to fail paths/stuck IO on Medium Errors To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1682644/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs