Re: [Beowulf] first cluster[B

2010-07-09 Thread Mark Hahn
Debian (lenny) why? centos is generally considered the safest choice, unless you're religiously committed to debian. Almost religiously. I have found it a very stable platform for everything up to clusters. OK. you should know that the stability comes from linux itself and the underlying

Re: [Beowulf] first cluster [was [OMPI users] trouble using openmpi under slurm]

2010-07-09 Thread Gus Correa
Douglas Guptill wrote: On Thu, Jul 08, 2010 at 09:43:48AM -0400, Gus Correa wrote: Douglas Guptill wrote: On Wed, Jul 07, 2010 at 12:37:54PM -0600, Ralph Castain wrote: Noafraid not. Things work pretty well, but there are places where things just don't mesh. Sub-node allocation in particu

[Beowulf] Question about maui scheduler and reservations on a node: logical AND or OR

2010-07-09 Thread Rahul Nabar
If there are twin reservations set for the same timespan on a certain node do they get ANDed or ORed? setres -u userfoo -s '+5' -d '10:00:00' node1 setres -u userbar -s '+5' -d '10:00:00' node1 Will userfoo have access to the node or userbar or neither? Or is it the first reservation that is alwa

Re: [Beowulf] first cluster

2010-07-09 Thread Douglas Guptill
On Fri, Jul 09, 2010 at 02:19:53PM -0400, Mark Hahn wrote: >> Debian (lenny) > > why? centos is generally considered the safest choice, > unless you're religiously committed to debian. Almost religiously. I have found it a very stable platform for everything up to clusters. If Debian fails to

[Beowulf] first cluster [was [OMPI users] trouble using openmpi under slurm]

2010-07-09 Thread Douglas Guptill
On Thu, Jul 08, 2010 at 09:43:48AM -0400, Gus Correa wrote: > Douglas Guptill wrote: >> On Wed, Jul 07, 2010 at 12:37:54PM -0600, Ralph Castain wrote: >> >>> Noafraid not. Things work pretty well, but there are places >>> where things just don't mesh. Sub-node allocation in particular is >>> an

Re: [Beowulf] diskless cluster questions

2010-07-09 Thread Vallard Benincosa
With diskless clusters you also need to be aware of the many ways to do it: - RAM root - where all of the OS is loaded in memory - NFS root - which is what a lot of people seem to call diskless - RamRoot/NFS root hybrid - where some directories like /root live on RAM and /usr lives on NFS for exam

Re: [Beowulf] diskless cluster questions

2010-07-09 Thread Steve Crusan
> then once you power them off if im not mistaken any data is sent back to the master for storage You should lose all of your changes if your OS is kept in RAM once you power off the node, reboot it, etc. On 7/7/10 10:33 AM, "Jonathan Aquilina" wrote: > > its actually easier if you use one os

Re: [Beowulf] diskless cluster questions

2010-07-09 Thread Andrew Latham
Tools like DHCP can manage information for individual nodes as a server. The nodes can be identified by the network card MAC address. Things like IP, Netmask, Router, DNS, Hostname and other options can be set per MAC identifier. A great example is the mass deployment of VoIP Hardphones that ask

[Beowulf] shutting down pbs server and maui for half an hour will affect running jobs?

2010-07-09 Thread akshar bhosale
hi, we have maintenance of pbs server so it is going down for half an hour ..will it affect running jobs?where is the timeout defined?can it be increased? on pbs mom side or pbs server side we need to change?any other parameter we need to check ?will it hold the already running jobs for half an hou