[gentoo-user] which linux RAID setup to choose?

Caveman Al Toraboran Sun, 03 May 2020 02:46:18 -0700

hi - i'm to setup my 1st RAID, and i'd appreciate
if any of you volunteers some time to share your
valuable experience on this subject.


my scenario
-----------

    0. i don't boot from the RAID.

    1. read is as important as write.  i don't
       have any application-specific scenario that
       makes me somehow favor one over another.
       so RAIDs that speed up the read (or write)
       while significantly harming the write (or
       read) is not welcome.

    2. replacing failed disks may take a week or
       two.  so, i guess that i may have several
       disks fail one after another in the 1-2
       weeks (specially if they were bought
       about the same time).

    3. i would like to be able to grow the RAID's
       total space (as needed), and increase its
       reliability (i.e. duplicates/partities) as
       needed.

       e.g. suppose that i got a 2TB RAID that
       tolerates 1 disk failure.  i'd like to, at
       some point, to have the following options:

         * only increase the total space (e.g.
           make it 3TB), without increasing
           failure toleration (so 2 disk failure
           would result in data loss).

         * or, only increase the failure tolerance
           (e.g. such that 2 disks failure would
           not lead to data loss), without
           increasing the total space (e.g. space
           remains 2TB).

         * or, increase, both, the space and the
           failure tolerance at the same time.

    4. only interested in software RAID.

my thought
----------

i think these are not suitable:

    * RAID 0: fails to satisfy point (3).

    * RAID 1: fails to satisfy points (1) and (3).

    * RAIDs 4 to 6: fails to satisfy point (3)
      since they are stuck with a fixed tolerance
      towards failing disks (i.e. RAIDs 4 and 5
      tolerate only 1 disk failure, and RAID 6
      tolerates only 2).


this leaves me with RAID 10, with the "far"
layout.  e.g. --layout=n2 would tolerate the
failure of two disks, --layout=n3 three, etc.  or
is it?  (i'm not sure).

my questions
------------

Q1: which RAID setup would you recommend?

Q2: how would the total number of disks in a
    RAID10 setup affect the tolerance towards
    the failing disks?

    if the total number of disks is even, then
    it is easy to see how this is equivalent
    to the classical RAID 1+0 as shown in
    md(4), where any disk failure is tolerated
    for as long as each RAID1 group has 1 disk
    failure only.

    so, we get the following combinations of
    disk failures that, if happen, we won't
    lose any data:

          RAID0
      ------^------
    RAID1       RAID1
    --^--       --^--
    F   .       .   .       < cases with
    .   F       .   .       < single disk
    .   .       F   .       < failures
    .   .       .   F       <

    F   .       .   F       < cases with
    .   F       F   .       < two disk
    .   F       .   F       < failures
    F   .       F   .       <
    .   F       F   .       <

    this gives us 4+5=9 possible disk failure
    scenarious where we can survive it without
    any data loss.

    but, when the number of disks is odd, then
    written bytes and their duplicates will
    start wrap around, and it is difficult for
    me to intuitively see how would this
    affect the total number of scenarious
    where i will survive a disk failure.

Q3: what are the future growth/shrinkage
    options for a RAID10 setup?  e.g. with
    respect to these:

    1. read/write speed.
    2. tolerance guarantee towards failing
       disks.
    3. total available space.

rgrds,
cm.

[gentoo-user] which linux RAID setup to choose?

Reply via email to