[EXTERNAL EMAIL]
20 seems to be the Enclosure number

20h = 32d right?

But it appears that the 600G drive is the problem.

Anyone disagree??

My results from a MegaCli request  ( ./MegaCli -PDList -aALL):

Adapter #0

Enclosure Device ID: 32
Slot Number: 0
Drive's position: DiskGroup: 0, Span: 0, Arm: 0
Enclosure position: N/A
Device Id: 0
WWN:
Sequence Number: 10
Media Error Count: 5
Other Error Count: 1
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS

Raw Size: 558.911 GB [0x45dd2fb0 Sectors]
Non Coerced Size: 558.411 GB [0x45cd2fb0 Sectors]
Coerced Size: 558.375 GB [0x45cc0000 Sectors]
Sector Size:  0
Firmware state: Online, Spun Up
Device Firmware Level: 0008
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c5004bf0a8cd
SAS Address(1): 0x0
Connected Port Number: 0(path0)
Inquiry Data: SEAGATE ST3600057SS     00086SL3LVX9
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: Unknown
Link Speed: Unknown
Media Type: Hard Disk Device
Drive Temperature :31C (87.80 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: Unknown
Port-1 :
Port status: Active
Port's Linkspeed: Unknown
Drive has flagged a S.M.A.R.T alert : No



Enclosure Device ID: 32
Slot Number: 1
Drive's position: DiskGroup: 0, Span: 0, Arm: 1
Enclosure position: N/A
Device Id: 1
WWN:
Sequence Number: 2
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS

Raw Size: 136.732 GB [0x11177328 Sectors]
Non Coerced Size: 136.232 GB [0x11077328 Sectors]
Coerced Size: 136.125 GB [0x11040000 Sectors]
Sector Size:  0
Firmware state: Online, Spun Up
Device Firmware Level: S527
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c50008e558c5
SAS Address(1): 0x0
Connected Port Number: 1(path0)
Inquiry Data: SEAGATE ST3146855SS     S5273LN4V6A0
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: Unknown
Link Speed: Unknown
Media Type: Hard Disk Device
Drive Temperature :28C (82.40 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: Unknown
Port-1 :
Port status: Active
Port's Linkspeed: Unknown
Drive has flagged a S.M.A.R.T alert : No




Exit Code: 0x00


On 2019/10/02 12:41 PM, Grzegorz Bakalarski wrote:

I'd remove physical disk number 20 from enclosure.

my guess: this disk has failing electronics and  it resets sas/scsci bus  (SES errors).

But I am nobody and you should not follow me!

grzegorz


W dniu 2019-10-02 18:19, [email protected] napisał(a):

Stephen, Thank you!
Poking around I found a battery not charging notice and
below is what I got as more information.
No idea what is means.    Suggestions anyone??   ~Vince H.


10/02/19 16:13:52: EVT#19525876-10/02/19 16:13:52: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:12: EVT#19525877-10/02/19 16:14:12: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:13: EVT#19525878-10/02/19 16:14:13: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:14: EVT#19525879-10/02/19 16:14:14: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:15: EVT#19525880-10/02/19 16:14:15: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:16: EVT#19525881-10/02/19 16:14:16: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:17: EVT#19525882-10/02/19 16:14:17: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:17: EVT#19525883-10/02/19 16:14:17: 113=Unexpected sense: Encl PD 20, CDB: 1c 01 02 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:17: EVT#19525884-10/02/19 16:14:17: 187=Enclosure PD 20(c None/p0) hardware error
10/02/19 16:14:17: sesGenericCallback: Not Responding 2 encl =1
10/02/19 16:14:22: SES enclosure 1 Recovered after fault
10/02/19 16:14:22: EVT#19525885-10/02/19 16:14:22: 167=Enclosure PD 20(c None/p0) communication restored 10/02/19 16:14:22: EVT#19525886-10/02/19 16:14:22: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00 10/02/19 16:14:23: EVT#19525887-10/02/19 16:14:23: 113=Unexpected sense: Encl PD 20, CDB: 12 00 00 00 04 00, Sense: 70 00 04 00 00 00 00 0a 00 00 00 00 35 05 00 00 00 00

Exit Code: 0x00


AND THIS:

Time: Wed Oct  2 16:06:37 2019

Code: 0x000000bb
Class: 2
Locale: 0x04
Event Description: Enclosure PD 20(c None/p0) hardware error
Event Data:
===========
Device ID: 32
Enclosure Index: 0
Slot Number: 255


seqNum: 0x0129f0ce
Time: Wed Oct  2 16:06:42 2019

Code: 0x000000a7
Class: 0
Locale: 0x04
Event Description: Enclosure PD 20(c None/p0) communication restored
Event Data:
===========
Device ID: 32
Enclosure Index: 0
Slot Number: 255


seqNum: 0x0129f0cf


Tried to look up the errors...  not making sense....







On 2019/10/01 13:42 PM, Stephen Dowdy wrote:
[EXTERNAL EMAIL]


On 10/1/19 10:41 AM, [email protected] <mailto:[email protected]> wrote:
Sep 26 15:25:47 Debian9 kernel: megaraid_sas 0000:01:00.0: 19474006
(622841147s/0x0004/CRIT) - Enclosure PD 20(c None/p0) hardware error

"Enclosure PD 20"  is hex  32.   this is typical enclosure address for
Dell server internal enclosures.

So, this appears to be reporting a hardware error on the enclosure,
rather than with any of the drives.


Probably want to get megacli or perccli and do :

     megacli fwtermlog dsply a0

     perccli /c0 show termlog

to get more details

--stephen

_______________________________________________
Linux-PowerEdge mailing list
[email protected] <mailto:[email protected]>
https://lists.us.dell.com/mailman/listinfo/linux-poweredge

_______________________________________________
Linux-PowerEdge mailing list
[email protected] <mailto:[email protected]>
https://lists.us.dell.com/mailman/listinfo/linux-poweredge



_______________________________________________
Linux-PowerEdge mailing list
[email protected]
https://lists.us.dell.com/mailman/listinfo/linux-poweredge

Reply via email to