From: "Vladimir 'φ-coder/phcoder' Serbinenko" <phco...@gmail.com>
On 18.11.2011 16:47, Diego Guella wrote:
Actually, I discovered that the bug is still there for me too, although in has another shape now.

I now have a RAID-1 with 4 members, I use 5 HDD and rotate them daily.
During the Debian installation, I created a 2-member RAID-1, and later I grew the array to 4 members.

If I understand this correctly your RAID never has all the devices connected. This leads to big desync (even writing once to an incomplete RAID causes desync). This is not a proper way to handle array. Frankly, I'm surprised anything works at all under such abuse.

I've used my 5 device RAID1 for over 2 years with lenny, and never got those problems.
Maybe I'm abusing RAID, but I don't think so.

I'd like to understand better what you are talking about "desync".

This is what I do (and have done since 2 years with lenny):
4-member RAID1: (a),(b),(c),(d), plus one HDD (e) disconnected from the system.
1.the system is on, 4 HDDs present, RAID1 ok.
2.turn the system off
3.remove HDD (d) from the system (it was /dev/sdd)
4.attach HDD (e) to the system (it will become the new /dev/sdd)
5.power on the system

At this point, the RAID1 is in this state:
-----
root@devilserver:~# cat /proc/mdstat
Personalities : [raid1]
md1 : active raid1 sdb3[2] sdc3[3] sda3[6]
     1903708024 blocks super 1.2 [4/3] [U_UU]

md0 : active raid1 sdb2[2] sdc2[3] sda2[6]
     48827320 blocks super 1.2 [4/3] [U_UU]

unused devices: <none>
-----
(md1 is mounted on /home, md0 is mounter on /)

6.add HDD (e) to the RAID1:
-----
root@devilserver:~# mdadm /dev/md0 -a /dev/sdd2
mdadm: re-added /dev/sdd2
root@devilserver:~# mdadm /dev/md1 -a /dev/sdd3
mdadm: re-added /dev/sdd3
-----

At this point, the RAID1 is in this state:
-----
root@devilserver:~# cat /proc/mdstat
Personalities : [raid1]
md1 : active raid1 sdd3[5] sdb3[2] sdc3[3] sda3[6]
     1903708024 blocks super 1.2 [4/3] [U_UU]
       resync=DELAYED

md0 : active raid1 sdd2[5] sdb2[2] sdc2[3] sda2[6]
     48827320 blocks super 1.2 [4/3] [U_UU]
[>....................] recovery = 1.1% (539648/48827320) finish=28.3min speed=28402K/sec

unused devices: <none>
-----

When the resync will complete, the RAID1 will be OK again.
The disconnected HDD (d) is an emergency copy of the system.
I can recover files from it, or even connect it to an identical system and get a working machine in 0 minutes in case of a disaster.

Now I'd like to understand:
-What's wrong with what I'm doing?
Pretend that drive (d) really dies when the system was turned off.
What I'm supposed to do in that situation? Pick a new drive (e), connect it to the system, and add it to the RAID1 array.
Isn't that the same?

I can grow the array to 5-members, the only downside of that is the annoying mail message from mdadm because of "DegradedArray event" at every boot of the machine.
I am open to other suggestions, too.

Diego



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to