[Beowulf] NFS & Scaling issues

Amrik Singh Fri, 06 Apr 2007 14:47:59 -0700

Hi,

We are running a cluster of 180 diskless compute nodes. 60 of them have32 bit AMD Semptron processors and rest are dual core AMD Athelon 64bit processors. 32 bit machines have 10/100 mbps and rest have gigabitethernet cards. We have four file servers, each hosting around 3.5TB onSATA drives connected to 3Ware RAID controller cards configured on RAID10 array. These file servers are exporting the drives through NFS. Eachfile server is running 265 daemons for nfsd.

The file servers are mainly hosting large number of small files rangingfrom 256KB to 2 MB. The compute nodes are primarily doing a searchthrough these files, so there is lot's of reading and some writing tothe file servers.

Recently we started noticing very high (70-90%) wait states on the fileservers when compute nodes. We have tried to optimize the NFS throughincreasing the number of daemons and the rsize and wsize but to no avail.

Can someone point us in the right direction as to how we should betrying to troubleshoot this problem.

PS: All the nodes are running SuSE 10.0 and servers are running SuSE10.0and 10.1 and all the drives are formatted with reiserfs.



thanks

--

Amrik


_______________________________________________
Beowulf mailing list, [email protected]
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

[Beowulf] NFS & Scaling issues

Reply via email to