Re: [Beowulf] Big storage

Bogdan Costescu Fri, 14 Sep 2007 09:04:41 -0700

On Fri, 14 Sep 2007, Bruce Allen wrote:

 I will try to get fsprobe deployed on as much of the Nordic LHC storage as
 possible.
I'll get fsprobe up and going on the new systems I am puttingtogether in Hannover, and will also try and encourage the rightpeople to get it running on some of the LIGO ScientificCollaboration's other storage systems.

I might be dense after holiday, but I still don't get the reasons forsuch an interest in running fsprobe. I can see it being used as aburn-in test and to prove that a running system can write then readdata correctly, but what does it mean about the data that is alreadywritten or about the data that is in flight at the time fsprobe isrun? (someone else asked this question earlier in the thread anddidn't get an answer either) How is fsprobe as a burn-in test betterthan, say, badblocks ?

I am genuinely interested in these answers because I have written asomehow similar tool 5-6 years ago to test new storage, simply becauseI didn't trust enough the vendors' burn-in test(s). My interest was abit larger in the sense that apart from data correctness I was alsochecking the behaviour of FS quota accounting (by creating randomlysized files with random ownership) and of the disk+FS in face offragmentation (by measuring "instantaneous" speed). But I never sawthe potential usage by other people mainly because I could not findanswers to the above questions, so I never thought about making itpublic... and now it's too late ;-)

There is another issue that I could never find a good answer to: howmuch testing a storage device should withstand before the testingitself becomes dangerous or disturbing ? Access by the test toolrequires usage of resources: sharing of connections, poluting ofcaches, heads that have to be moved. For example, for the 1.somethingGB/s figure that was mentioned earlier in this thread, would youaccept a halving of the speed while the data integrity test is beingrun ? Or more generally, how much of the overall performance of thestorage system would you be willing to give up for the benefit ofknowing that data can still be written and then read correctly ? Andsadly I miss some data in the results that Google and others publishedrecently: how much were the disks seeking (moving heads) during theirfunctioning ? I imagine that it's hard to get such data (shouldprobably be from the disk as opposed to kernel, as firmware couldstill reorder), but IMHO is valuable for those designing multi-userstorage systems where disks move heads frequently to access filesbelonging to different users (and therefore spread on the disk) thatare used "simultaneously".


--
Bogdan Costescu

IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen
Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY
Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868
E-mail: [EMAIL PROTECTED]
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] Big storage

Reply via email to