You are correct that you could have data loss, but he could still "survive" a two-drive loss IF the they were the "proper" two drives (OA+MB or OB+MA in your example). I should have stated that I didn't recommend running in this degraded mode, but the system could survive a two drive loss with RAID10 to get the system down and the problem corrected.
We've used RAID10 (mirror then stripe) both hardware and software on our production database servers for sometime now. I can tell you that you can loose multiple drives and still run in degraded mode. Obviously you wouldn't want to run in this situation in production, but it does give you time to get the hardware you need and/or shutdown the system so that no data loss occurs. For example, we have 2 Dell PowerVault 220S disk arrays with 14 drives in each array as the data area for our production Linux database server. We had a Dell PERC3/QC four-channel RAID controller connected to the PowerVaults. We had a combination of hardware RAID1 and software RAID0 because of a limitation of the hardware RAID card. Check out http://w3.one.net/~djflux/graphics/raiddiag.png for a visual (ignore the software RAID1 (/dev/md1) as we don't use that anymore, but the top portion is what we have now with all software RAID). The 220's can have a split backplane so that two SCSI controller channels (or 2 channels of a RAID controller) can control one 14-disk PV. One morning I came in and one whole side of one PowerVault was offline (7 drives) and the machine was still cranking away with no problems (other than a quarter of it's array was gone :) ). I shutdown the server, checked all the drives, replaced the SCSI cable and away I went. No data loss. I admit I was lucky but it always pays to have good backups as well ;) As I said, I don't recommend running a production system degraded like that, but from experience I know that it will stay up IF the "correct" drives go offline. My apologies for not being precise :) Talk to you later, Andy. -----Original Message----- From: Ward William E DLDN [mailto:[EMAIL PROTECTED] Sent: Thursday, March 20, 2003 3:22 PM To: '[EMAIL PROTECTED]' Subject: RE: RAID modes and priority > -----Original Message----- > From: Rechenberg, Andrew > [mailto:[EMAIL PROTECTED] > Sent: Thursday, March 20, 2003 9:12 AM > To: [EMAIL PROTECTED] > Cc: [EMAIL PROTECTED] > Subject: RE: RAID modes and priority > > > RAID10 can theoretically survive a two drive loss in your siutation > (they have to be the correct two drives :) ), whereas RAID0+1 will die > if you have a two drive loss. Andrew, that can't be right in the situation that Peter was describing. Raid 0+1 is striping and mirroring, vice RAID 10 being mirroring and striping. In a 4 drive setup, that means he has a stripe across two drives, and then is mirroring those two drives to the other two. Like this: OA-OB STRIPE CHAIN | | MIRROR MA-MB STRIPE CHAIN With O being the "original" and M being the "mirror" sets. Obviously, you can lose any ONE drive without loss of data in this setup, since you have a totally intact stripe on either the Original side or the Mirror side. Depending on how you implement the RAID unit (and whether it's hot swappable, for example) you may not even have to go down, although that's highly dependent on your particular software RAID controller. Now, if you lose TWO drives at one time, you have two distinct possibilities. You can lose either an entire stripe (either the original or the mirror) or you can lose one drive off of each. Again, if you lose an entire stripe, obviously lose nothing, and again, depending on your setup, you may not even go down, though you will obviously fail down to a RAID 0. It gets more dangerous, however, when you lose a drive off each stripe... Let's say you lose OA and MB (or MA and OB, doesn't matter). In that case, you're going to have some minor problems, depending on your RAID controller. In this case, you should be able to safely shut down, and you wouldn't lose any data, but you DEFINITELY don't want to run in this configuration. Most Software RAID units won't start in this configuration, but some should be able to RUN in this configuration long enough to shutdown. You could be back up and operable (albeit without mirroring, so defaulting back down to RAID 0), within minutes, though, and it may be as simple as a change in the RAID lashing in a the RAID BIOS on some software RAID, such as the ones on some motherboards, without a need to go in and physically switch the cables around. The WORST case is if you lose both "pairs" of drives, the original and the mirror (OA and MA or OB and MB). You have lost data, and CANNOT recover in this case; better pull out your backups. Even worse, you won't be able to recover much (depending on the size of the stripes) even from the existing drives; if a file isn't COMPLETELY on one drive (the surviving drive) then it's unrecoverable. However, it's pretty obvious that even in 2 drive failure mode, that should only happen in 1 our of three cases. And 2 drive failure mode should be pretty uncommon to begin with, barring a SPECIFIC event, in which case you have other problems, anyway; lightening strikes, power surges, etc., should be engineered out at the beginning, via UPS and line conditioning. RAID 0+1 does not guarantee under all possible two drive failure modes that you don't lose data, but it's still fairly robust even in that mode. No RAID unit can support 1/2 of the drives being removed under all regimens, either (except RAID 1, in a 2 drive setup, but I don't think we're really talking about just mirroring, are we?). RAID10 and RAID 0+1 are (in a way) opposites, but in both cases, they should behave somewhat similarly under this particular failure mode. Of course, you could ignore everything I've said; all of the RAID units I've ever built were RAID 5 (although that's changing this weekend, as I'm going to be setting up RAID0 on both my wife's and my computers (CompUSA had 7200RPM Maxtor 120GB ATA-133 HDD for $79.95 after mail in rebate... sale ends today, for folks who might suddenly be saying "Hmmmm" ;) )so my experience with RAID 0, RAID 1, RAID10, and RAID 0+1 is all theoretical. I've built plenty of RAID 5 units, though. -- redhat-list mailing list unsubscribe mailto:[EMAIL PROTECTED] https://listman.redhat.com/mailman/listinfo/redhat-list -- redhat-list mailing list unsubscribe mailto:[EMAIL PROTECTED] https://listman.redhat.com/mailman/listinfo/redhat-list