Re: [Beowulf] diskless booting over jumbo frames

Amrik Singh Wed, 25 Apr 2007 09:59:43 -0700

I am sorry for not making things more clear. scp was faster then NFSonly when NFS server was being hammered by the client nodes. Thishappens only when number of client nodes increase above a certainnumber. In my test I was reading files (1-2 MB) from an NFS server from180 nodes. All the nodes were reading different files from differentfolders and there was no caching. There was a huge io wait time on theserver CPUs.


The setup is like this:


Server exports /data
Clients mount /data and read files from /data/node1, /data/node2 and so on.

While the server is busy, if I try to do

node1:/tmp# scp largefile1 server:/data/node1
node1:/tmp# cp largefile2 /data/node1

Now in this situation scp is faster then the cp. This lead me to believethat NFS is the bottleneck when it hammered by requests from too many nodes.

Amrik



Bogdan Costescu wrote:

On Wed, 25 Apr 2007, Amrik Singh wrote:
I agree that Jumbo Frames would not be a great help with the rootfile system but we hope to get a better performance from other NFSservers.
I don't quite follow you here: the link that was sent earlier hadshown some kernel level autoconfig problem. If this is indeed stillthe case with newer kernels, this should only affect the root FS -before mounting the other NFS exports, you should have the chance toperform a proper initialization of the network, which would give youthe possibility to use a larger MTU.
As all the machines on the same subnet have to be using the jumboframes,
Why ? The MTU specifies a _maximum_ value; even when using 1500, notall packets are exactly 1500 bytes. The larger MTU _allows_ largervalues, but doesn't _force_ them.
Even though NFS is extremely slow, copying files over scp is stillvery fast between a client and server.
If by this you mean that NFS is slower than scp, then this should beexactly the other way around, because scp needs CPU time for itsencryption. I would have said that you have some network problems anduse NFS over UDP which leads to high retransmission rates; scp wouldadapt better to the network problems due to using TCP... but you justmention that you tried switching between TCP and UDP.
We have tried all different ways to tune the NFS for a betterperformance (increasing NFS deamons on the servers, changing rsize &wsize, using TCP vs UDP, using async vs sync, noatime, timeo).
Could it be that you tried so hard to tune it that you just used toomany settings that don't play well together in your particularsituation ? I've found the NFS client and server in the kernels of therecent years to perform reasonably well in their defaultconfiguration; although they could be further optimized, a NFStransfer in these conditions would always beat a scp one.
initrd=bzImage ramdisk=40960
dhclient could be located in the ramdisk. Actually even a whole rootFS could be located in the ramdisk, if there are not too many nodesbooting at the same time and leading to UDP packet loss.
... and you already have a rather large ramdisk. Have you created ityourself (and therefore know what's in it and how to add some more) ?


_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] diskless booting over jumbo frames

Reply via email to