On Mon, 8 Mar 2010 10:39:08 -0500 Glen Beane <[email protected]> wrote:
> > > > On 3/8/10 10:14 AM, "Micha Feigin" <[email protected]> wrote: > > I have a small local cluster in our lab that I'm trying to setup with minimum > hustle to support both cpu and gpu processing where only some of the nodes > have > a gpu and those have only two gpu for four cores. > > It is currently setup using torque from ubuntu (2.3.6) with the torque > supplied > scheduler (set it up with maui initially but it was a bit of a pain for such a > small cluster so I switched) > > This cluster is used by very few people in a very controlled environment so I > don't really need any protection from each other, the queues are just for > convenience to allow remote execution > > The problem: > > I want to allow gpu related jobs to run only on the gpu equiped nodes (i.e > more jobs then GPUs will be queued), I want to run other jobs on all nodes > with either > 1. a priority to use the gpu equiped nodes last > 2. or better, use only two out of four cores on the gpu equiped nodes > > It doesn't seem though that I can map nodes or cores to queues with torque as > far as I can tell > (i.e cpu queue uses 2 cores on gpu1, 2 cores on gpu2, all cores on everything > else > gpu queue uses 2 cores on gpu1, 2 cores on gpu2) > > I can't seem to set user defined resources so that I can define gpu machines > as having gpu resource and schedule according to that. > > Is it possible to achieve any of these two with torque, or is there any other > simple enough queue manager that can do this (preferably with a debian package > in some way to simplify maintanance). I only manage this cluster since no one > else knows how to and it's supposed to take as little of my time as possible > I'm looking for the simplest solution to implement and not the most versatile > one. > > > you can define a resource "gpu" in your TORQUE nodes file: > > hostname np=4 gpu > > and then users can request -l nodes=1:ppn=4:gpu to get assigned a node with a > gpu, but to do anything more advanced you'll need Maui or Moab. You should > try the maui users mailing list, or the torque users mailing list to see if > anyone else has some ideas Thanks, almost perfect. It would have been a complete solution if there was a way to define how many such resources there are as there are 4 cores and 2 GPUs per node. Its good enough for now though as it works perfect when asking for nodes=1:ppn=2 to make sure that I don't get too many GPU jobs. This is a cluster that is used by 3 people that are cooperating at the moment so I can waste the extra core for now to spare man hours for the setup of maui. _______________________________________________ Beowulf mailing list, [email protected] sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
