> -----Original Message----- > From: Guilherme G. Piccoli [mailto:[email protected]] > Sent: Friday, December 22, 2017 8:32 PM > To: Elior, Ariel <[email protected]>; Dept-Eng Everest Linux L2 <Dept- > [email protected]> > Cc: [email protected]; [email protected]; > [email protected] > Subject: [PATCH net] bnx2x: Improve reliability in case of nested PCI errors > > While in recovery process of PCI error (called EEH on PowerPC arch), another > PCI transaction could be corrupted causing a situation of nested PCI errors. > Also, > this scenario could be reproduced with error injection mechanisms (for debug > purposes). > > We observe that in case of nested PCI errors, bnx2x might attempt to > initialize > its shmem and cause a kernel crash due to bad addresses read from MCP. > Multiple different stack traces were observed depending on the point the > second > PCI error happens. > > This patch avoids the crashes by: > > * failing PCI recovery in case of nested errors (since multiple PCI errors > in a row > are not expected to lead to a functional adapter anyway), and by, > > * preventing access to adapter FW when MCP is failed (we mark it as failed > when shmem cannot get initialized properly). > > Reported-by: Abdul Haleem <[email protected]> > Signed-off-by: Guilherme G. Piccoli <[email protected]>
Acked-by: Shahed Shaikh <[email protected]> Thanks, Shahed
