[Mauiusers] Two queues/classes, with one blocking all jobs

Åke Sandgren ake.sandgren at hpc2n.umu.se
Tue Jan 16 07:50:42 MST 2007


On Tue, 2007-01-16 at 09:40 -0500, Bill Wichser wrote:
> I have a cluster of Linux nodes with two queues/classes.  One is the ib 
> queue and the other the default.  Users must specify the #PBS -q ib to 
> get into the ib queue otherwise they go into this default queue.
> 
> For the sake of example, lets say that there are 64 nodes of IB and 64 
> nodes or GigE (default).
> 
> I define in the server_priv/nodes file the attribute of either ib or 
> noib to the hosts.  In my qmgr I define
> 
> set queue default resources_default.neednodes = noib
> set queue ib resources_default.neednodes = ib
> set queue ib resources_max.nodect = 64
> 
> In maui.cfg I have set
> 
> CLASSCFG[ib]    MAXPROC=128 MAXNODE=64
> 
> Now the problem.
> 
> As the ib queue starts running and jobs fill it up, I find that wider 
> jobs move into the blocked state, as expected, as resources are not 
> available.  This leaves open the situation where narrower jobs still 
> being able to fit are scheduled before these wider jobs in the blocked 
> state as the resources are available.  These wide jobs continue to just 
> sit there never moving to the idle state and thus never run until the 
> point when maybe two of the narrower jobs finish at the same time.  Then 
> there are enough resources available to run these wider jobs.

What's your "RESERVATIONPOLICY"

If not already set to this then try,
RESERVATIONPOLICY       CURRENTHIGHEST
RESERVATIONDEPTH        20

Or some nice value for depth. And the tune your priority values until
you are satisfied.

-- 
Ake Sandgren, HPC2N, Umea University, S-90187 Umea, Sweden
Internet: ake at hpc2n.umu.se   Phone: +46 90 7866134 Fax: +46 90 7866126
Mobile: +46 70 7716134 WWW: http://www.hpc2n.umu.se



More information about the mauiusers mailing list