Re: [Beowulf] cluster scheduler implemented via MPI?

2010-05-15 Thread Peter St. John
I just want to remark that I'm curious about peer-to-peer scheduling, where nodes would negotiate among each other according to priorities, needs, and idle resources. Peter On Sat, May 15, 2010 at 9:01 AM, Andrew Piskorski wrote: > A cluster scheduler (or resource manager) like SGE, SLURM, or To

Re: [Beowulf] cluster scheduler for dynamic tree-structured jobs?

2010-05-15 Thread Skylar Thompson
On 05/15/10 08:44, Andrew Piskorski wrote: SGE does this and can make it available as XML. Which reminds me, I need to look harder to figure out WHERE exactly SGE stores its node configuration data, and how I can perhaps extend it with additional information, like the network topology betw

Re: [Beowulf] cluster scheduler for dynamic tree-structured jobs?

2010-05-15 Thread Rayson Ho
On Sat, May 15, 2010 at 10:44 AM, Andrew Piskorski wrote: > Yes, that's what I did with SGE, that part works fine.  SGE's other > behaviors often leave much to be desired. Just because the default settings of SGE do not follow your workflow does not mean that "SGE's other behaviors often leave mu

Re: [Beowulf] cluster scheduler for dynamic tree-structured jobs?

2010-05-15 Thread Andrew Piskorski
On Sat, May 15, 2010 at 07:33:08AM -0700, Skylar Thompson wrote: > I'm not quite sure I understand what you're doing, but if you make all > your execution hosts submit hosts as well you can submit jobs within > your running jobs. You can use "-now y -sync y" in your jobs to ensure Yes, that's wha

Re: [Beowulf] cluster scheduler for dynamic tree-structured jobs?

2010-05-15 Thread Skylar Thompson
On 05/15/10 03:24, Andrew Piskorski wrote: Folks, I could use some advice on which cluster job scheduler (batch queuing system) would be most appropriate for my particular needs. I've looked through docs for SGE, Slurm, etc., but without first-hand experience with each one it's not at all clear t

[Beowulf] cluster scheduler implemented via MPI?

2010-05-15 Thread Andrew Piskorski
A cluster scheduler (or resource manager) like SGE, SLURM, or Torque can itself be viewed as a parallel application running on a cluster. So I'm wondering, have any such schedulers been implemented *as* an MPI program? (Or PVM or whatever else?) If not, why not? Does the MPI programming environm

[Beowulf] Re: cluster scheduler for dynamic tree-structured jobs?

2010-05-15 Thread Andrew Piskorski
On Sat, May 15, 2010 at 06:24:54AM -0400, Andrew Piskorski wrote: > 1. I have lots of embarrassingly parallel tree-structured jobs which I > dynamically generate and submit from top-level user code (which > happens to be written in R). E.g., my user code generates 10 or 100 > or 1000 jobs, and ea

[Beowulf] cluster scheduler for dynamic tree-structured jobs?

2010-05-15 Thread Andrew Piskorski
Folks, I could use some advice on which cluster job scheduler (batch queuing system) would be most appropriate for my particular needs. I've looked through docs for SGE, Slurm, etc., but without first-hand experience with each one it's not at all clear to me which I should choose... I've used Sun