Re: [Beowulf] Transient NFS Problems in New Cluster

2010-02-02 Thread Henning Fehrmann
On Tue, Feb 02, 2010 at 02:00:37PM -0800, Jon Forrest wrote: > I have a new cluster running CentOS 5.3. > The cluster uses a Sun 7310 storage server > that provides NFS service over a private > 1Gb/s ethernet with 9K jumbo frames to the > cluster. > > We've noticed that a number of the compute > n

Re: [Beowulf] hardware question - which PSU for this?

2010-02-02 Thread Andrew Piskorski
On Tue, Feb 02, 2010 at 10:41:44AM +0100, Vincent Diepeveen wrote: > This seems ideal mainboard for beowulf clusters. built in infiniband > it seems. > > http://cgi.ebay.com/Arima-AMD-Opteron-Quad-Core-Socket-F-3000-series-Server_W0QQitemZ390149471460QQcmdZViewItemQQptZCOMP_EN_Networking_Compon

Re: [Beowulf] problem of mpich-1.2.7p1

2010-02-02 Thread Gus Correa
Hi Christian What is the content of your file /mirror/mpich-1.2.7p1/share/machines.LINUX? Please send it on your next message, it may clarify. It looks like to me that your program is working correctly. (I am guessing a bit, because you didn't send the source code.) When you did "mpirun -np 1

Re: [Beowulf] problem of mpich-1.2.7p1

2010-02-02 Thread Gus Correa
PS - And don't run the programs as root! Gus Correa Gus Correa wrote: Hi Christian Somehow your program was not attached to the message. In any case, you didn't say anything about your "machinefile" contents. You need to list the nodes you want to use there. The command line will be something

Re: [Beowulf] problem of mpich-1.2.7p1

2010-02-02 Thread Gus Correa
Hi Christian Somehow your program was not attached to the message. In any case, you didn't say anything about your "machinefile" contents. You need to list the nodes you want to use there. The command line will be something like this: mpirun -np 4 -machinefile my_machinefile canon "man mpirun"

[Beowulf] problem of mpich-1.2.7p1

2010-02-02 Thread christian suhendra
hello guys i have installed mpich-1.2.7p1 on ubuntu 9.04, i have configured hte NFS and RSH.. i use device=ch_p4,, but when i ran my program it's like not working i've got this result : r...@cluster3:/mirror/mpich-1.2.7p1# mpirun -np 1 canon Process 0 of 1 on cluster3 Total Time: 4.316000 msecs r..

Re: [Beowulf] Transient NFS Problems in New Cluster

2010-02-02 Thread Jon Forrest
On 2/2/2010 2:29 PM, Joe Landman wrote: We still recommend turning off autofs for home directories. We've seen lots of problems with it on many clusters. Hard mounts are IMO better. That server should be able to handle it. These problems were also happening for another non-home mount, but I he

Re: [Beowulf] Transient NFS Problems in New Cluster

2010-02-02 Thread Joe Landman
Jon Forrest wrote: I have a new cluster running CentOS 5.3. The cluster uses a Sun 7310 storage server that provides NFS service over a private 1Gb/s ethernet with 9K jumbo frames to the cluster. We've noticed that a number of the compute nodes sometimes generate the automount[15023]: umount_au

[Beowulf] Transient NFS Problems in New Cluster

2010-02-02 Thread Jon Forrest
I have a new cluster running CentOS 5.3. The cluster uses a Sun 7310 storage server that provides NFS service over a private 1Gb/s ethernet with 9K jumbo frames to the cluster. We've noticed that a number of the compute nodes sometimes generate the automount[15023]: umount_autofs_indirect: ask u

RE: [Beowulf] hardware question - which PSU for this?

2010-02-02 Thread Hearns, John
> > intel and amd seem to have canned all type of new cpu's. i read now > nehalem-ex is just 6 cores, no longer 8 and 2.26Ghz, Vincent, please can you provide a reference for this? My understanding is that Nehalem-EX will be available in eight core versions and a six core edition, which is attra

[Beowulf] hardware question - which PSU for this?

2010-02-02 Thread Vincent Diepeveen
hi, This seems ideal mainboard for beowulf clusters. built in infiniband it seems. http://cgi.ebay.com/Arima-AMD-Opteron-Quad-Core-Socket-F-3000-series- Server_W0QQitemZ390149471460QQcmdZViewItemQQptZCOMP_EN_Networking_Compon ents?hash=item5ad6b87ce4 they get offered regurarly and cheap.

[Beowulf] storage solution/investment strategy

2010-02-02 Thread tegner
Please excuse me, I forgot to put in the subject. Probably best to just disregard my previous post (content is the same). /jon This will boil down to a questions eventually, but I need to give some background first. We are a small group doing CFD, and when we several years ago realized that beowu