Just had a problem with softraid on a 4.6 box. No, I don't ask to solve it, it needed urgent replacement, and so I did. What I would like to ask for, is advice on best practices for softraid under OpenBSD, to prevent similar things from happening again; getting hints on how to set it up better, and mostly: how to recover it better.

What happened, was that some slices in a softraid simply went away after some power surge. In detail: sd1 and sd2 were set to RAID, and the ensuing RAID1 (sd3) sliced up into a number of /usr/, /var, /home/, /var/www, /var/mail, swap. After the reboot after a power surge, two of the slices (/var/mail, sd3g and /home, sd3h) were simply unavailable, couldn't be 'mount -a'-ed at reboot, and the system fell back to '/' only being mounted (on sd0). Strangely, though, disklabel sd3 showed the slices, as sd3g, sd3h. But they could not be accessed at all; and were not visible under /dev/. Still, an unexpected bahaviour as far as I am concerned, even more so since sysctl and bioctl showed an 'OK' and 'Online' softraid. I tried a few things, like fsck_ffs on these two disappeared slices, as well as the 'good' ones. The good ones were good, also with fsck_ffs -f. But the two gone missing were just not available (as devices). Then I made, I guess, a big mistake, and instead of ripping out one of the drives, I bioctl -d -ed sd3; leaving 2 drives with RAID file system on them. Over.

Now, please, any suggestions on how to do better next time something like this happens?
Thanks in advance,

Uwe

Reply via email to