Re[2]: Re[2]: AHCI timeout when using ZFS + AIO + NCQ

2013-01-30 Thread Vladislav Prodan
> I once ran into a very severe AHCI timeout problem. After months of trying to > figure it out and insane "Hardware_ECC_Recovered" error values, I found that > the error was with the power connector plug / sata HDD interface. All errors > disappeared after replacing that cable. Since you have e

Re: Re[2]: AHCI timeout when using ZFS + AIO + NCQ

2013-01-27 Thread Beeblebrox
I once ran into a very severe AHCI timeout problem. After months of trying to figure it out and insane "Hardware_ECC_Recovered" error values, I found that the error was with the power connector plug / sata HDD interface. All errors disappeared after replacing that cable. Since you have error on mor

Re[2]: AHCI timeout when using ZFS + AIO + NCQ

2013-01-27 Thread Vladislav Prodan
> >> Essentially the combination of SATA 3 speeds the midplane / backplane > >> degraded the connection between the MB and HDD enough to cause > >> the disks to randomly drop when under load. > >> > >> If we connected the disks directly to the MB with SATA cables the > >> problem went away. In

Re[2]: Re[2]: AHCI timeout when using ZFS + AIO + NCQ

2013-01-27 Thread Vladislav Prodan
> - Original Message - > From: "Vladislav Prodan" > > >> Is it always the same disk, of so replace it SMART helps identify issues > >> but doesn't tell you 100% there's no problem. > > > > > > Now it has fallen off a different HDD - ada0. > > I'm 99% sure that MHDD will not find prob

Re[2]: AHCI timeout when using ZFS + AIO + NCQ

2013-01-27 Thread Vladislav Prodan
> Is it always the same disk, of so replace it SMART helps identify issues > but doesn't tell you 100% there's no problem. Now it has fallen off a different HDD - ada0. I'm 99% sure that MHDD will not find problems in HDD - ada0 and ada2. I still have three servers with similar chipsets that ha

Re: Re[2]: AHCI timeout when using ZFS + AIO + NCQ

2013-01-27 Thread Steven Hartland
- Original Message - From: "Vladislav Prodan" Is it always the same disk, of so replace it SMART helps identify issues but doesn't tell you 100% there's no problem. Now it has fallen off a different HDD - ada0. I'm 99% sure that MHDD will not find problems in HDD - ada0 and ada2.