Our developers  had  that issue  of inconsistent  file system  view in RHEL
based systems, some of it is solved by disabling dir list caching, another
by using noac,

well, developers should be smart enough to know what FS they're using,
and how it's intended to behave. turning off AC is a nice option, but smarter is to leave it on and not try to cause race conditions. (I expect that such race-friendly behavior will fail on some other non-NFS filesystems, though probably harder to trigger.)

what the other was doing was writing simultaneously to  the
same file partitioned over several nodes, I told this is probably not the
right way to do file writing. apparently he used to do it in Sun Solaris and
it worked flawlessly.

I would spank any developer who said "but it works on platform X"!
developers must be aware of the spec, not merely what they can get away with somewhere, sometime. of course, this is the thinking behind apps having "supported" platforms - just a fancy way of saying "no, we don't know what standards-conformance we need, or how we violate the standard, but here's a few places we haven't yet noticed any bad-enough bugs".

writing to different sections of a file is probably wrong on any networked FS, since there will inherently be obscure interactions with the size and alignment of the writes vs client pagecache, network transport, actual network FS, server pagecache and underlying server/disk FS. in my experience, people who expect it to "just work"
have an incredibly naive model of how a network FS works (ie, write()
produces an RPC direct to the server)
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to