I just want to remark that I'm curious about peer-to-peer scheduling, where
nodes would negotiate among each other according to priorities, needs, and
idle resources.
Peter
On Sat, May 15, 2010 at 9:01 AM, Andrew Piskorski wrote:
> A cluster scheduler (or resource manager) like SGE, SLURM, or To
On 05/15/10 08:44, Andrew Piskorski wrote:
SGE does this and can make it available as XML.
Which reminds me, I need to look harder to figure out WHERE exactly
SGE stores its node configuration data, and how I can perhaps extend
it with additional information, like the network topology betw
On Sat, May 15, 2010 at 10:44 AM, Andrew Piskorski wrote:
> Yes, that's what I did with SGE, that part works fine. SGE's other
> behaviors often leave much to be desired.
Just because the default settings of SGE do not follow your workflow
does not mean that "SGE's other behaviors often leave mu
On Sat, May 15, 2010 at 07:33:08AM -0700, Skylar Thompson wrote:
> I'm not quite sure I understand what you're doing, but if you make all
> your execution hosts submit hosts as well you can submit jobs within
> your running jobs. You can use "-now y -sync y" in your jobs to ensure
Yes, that's wha
On 05/15/10 03:24, Andrew Piskorski wrote:
Folks, I could use some advice on which cluster job scheduler (batch
queuing system) would be most appropriate for my particular needs.
I've looked through docs for SGE, Slurm, etc., but without first-hand
experience with each one it's not at all clear t
A cluster scheduler (or resource manager) like SGE, SLURM, or Torque
can itself be viewed as a parallel application running on a cluster.
So I'm wondering, have any such schedulers been implemented *as* an
MPI program? (Or PVM or whatever else?) If not, why not? Does the
MPI programming environm
On Sat, May 15, 2010 at 06:24:54AM -0400, Andrew Piskorski wrote:
> 1. I have lots of embarrassingly parallel tree-structured jobs which I
> dynamically generate and submit from top-level user code (which
> happens to be written in R). E.g., my user code generates 10 or 100
> or 1000 jobs, and ea
Folks, I could use some advice on which cluster job scheduler (batch
queuing system) would be most appropriate for my particular needs.
I've looked through docs for SGE, Slurm, etc., but without first-hand
experience with each one it's not at all clear to me which I should
choose...
I've used Sun