Re: recommendations for supported, affordable hardware raid controller.

David Christensen Sat, 02 Jan 2021 13:35:34 -0800

On 2021-01-02 03:24, Andrei POPESCU wrote:

http://www.unixsheikh.com/articles/battle-testing-data-integrity-verification-with-zfs-btrfs-and-mdadm-dm-integrity.html


That looks interesting.  Thanks for the link.  :-)


On 2021-01-02 08:08, Richard Hector wrote:

On 3/01/21 12:24 am, Andrei POPESCU wrote:

In case of data corruption (system crash, power outage, user error,
or even just a HDD "hiccup") plain md without the dm-integrity
layer won't even be able to tell which is the good data and will
overwrite your good data with bad data. Silently.

I've had crashes and power outages and never noticed any problems,
not that that means they won't happen (or even that they haven't
happened). Does a journalling filesystem on top not cover that?

AIUI a journaling filesystem provides a two-step process to achieveatomic writes of multiple sectors to disk -- e.g. a process wants to putsome data into a block here (say, a file), a block there (say, adirectory), etc., and consistency of the on-disk data structures must bepreserved. The journal provides a two-step process whereby everythingis written to the journal, then everything is written to disk. Ifeither step is interrupted, the filesystem driver will detect thefailure and respond. When done, either all of the blocks have beenupdated on disk or none of the blocks on disk have been changed.

Integrity checking addresses different failure modes by applyingchecksums to data blocks and metadata blocks. If the contents of ablock become corrupt, either in memory, in transit, on disk, etc., thedriver will detect the failure and respond. If redundant data isavailable, such as via RAID, the driver will correct the data andoperations continue. If no redundant data is available, the driver willgenerate an error. File system layering features in the Linux kernelallow you to add the dm-integrity device mapper layer into a storagestack as desired:


https://www.kernel.org/doc/html/latest/admin-guide/device-mapper/dm-integrity.html

On a related note, it is wise to have ECC memory to protect against datacorruption in memory:


http://www.openoid.net/will-zfs-and-non-ecc-ram-kill-your-data/

More failure modes exist (potentially, an infinite number). It's aquestion of what failure modes and effects concern you, and how muchtime and money you want to spend to mitigate risks.



David

Re: recommendations for supported, affordable hardware raid controller.

Reply via email to