Public bug reported:

[Impact]
When error handling progresses to host reset, several issues may prevent the 
system from recovering, therefore requiring a system power cycle.

[Test Case]
$ iozone -a &
$ while :; do sudo sg_reset --device /dev/sda; sleep 5; done

[Fix]
There is a race between device removal and host reset which can cause the 
driver to hang, preventing the user from accessing attached devices until a 
reboot. Fix this by adding locking around the critical path.

After a soft host reset, commands maybe sent to the device before the
hardware is ready to receive them. This can result in additional errors
when the user access the device. Fix this by blocking commands until the
hardware has been reinitialized.

Stale PHY events may still get processed by the driver after reset. This
can cause e.g. ports to be detached because an old pre-reset "phy down"
event gets processed, causing the user to lose access to attached
devices. Fix this by filtering out pre-reset PHY events.

Resource starvation can occur after a "clear nexus ha" reset. Fix this
by releasing those resources during the reset.

[Regression Risk]
The required fixes are localized to the hisi_sas driver. This driver is only 
used by two platforms supported by Ubuntu: HiSilicon D05 and HiSilicon D06. We 
will directly verify these fixes on those platforms.

** Affects: linux (Ubuntu)
     Importance: Undecided
     Assignee: dann frazier (dannf)
         Status: In Progress

** Affects: linux (Ubuntu Bionic)
     Importance: Undecided
     Assignee: dann frazier (dannf)
         Status: In Progress

** Also affects: linux (Ubuntu Bionic)
   Importance: Undecided
       Status: New

** Changed in: linux (Ubuntu)
       Status: New => In Progress

** Changed in: linux (Ubuntu Bionic)
       Status: New => In Progress

** Changed in: linux (Ubuntu)
     Assignee: (unassigned) => dann frazier (dannf)

** Changed in: linux (Ubuntu Bionic)
     Assignee: (unassigned) => dann frazier (dannf)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1777696

Title:
  hisi_sas: Failures during host reset

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1777696/+subscriptions

-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to