[Mauiusers] policy confusion

Caird, Andrew J acaird at umich.edu
Thu Nov 9 09:44:09 MST 2006


What does 
      checkjob 5846
say?  Usually it will tell you why a job is not running.

--andy

> -----Original Message-----
> From: mauiusers-bounces at supercluster.org 
> [mailto:mauiusers-bounces at supercluster.org] On Behalf Of Paul 
> Van Allsburg
> Sent: Thursday, November 09, 2006 11:35 AM
> To: mauiusers at supercluster.org
> Subject: [Mauiusers] policy confusion
> 
> I have what seems to be a simple policy but my job is stuck 
> in the queue and I don't know why.  The cluster is 16 
> nodes/32processors, I have 4 
> queues, 'normal' is the default.   The is the cluster current status:
> 
> Job id           Name             User             Time Use S Queue
> ---------------- ---------------- ---------------- -------- - -----
> 5805.curie       ...o1-7imp-md123 hinkle           13:37:24 R 
> normal         
> 5836.curie       ...o1-2imp-md125 hinkle                  0 Q 
> normal         
> 5837.curie       ...o1-9imp-md136 hinkle                  0 Q 
> normal         
> 5846.curie       cpuburn          vanallp                 0 Q normal
> 
> I have Hinkle limited to  4 processors,  job 5805 is using 
> all 4.  I submitted cpuburn to a single node but it's not running. 
> My maui.cfg is:
> 
> # maui.cfg 3.2p8
> RMCFG[base] TYPE=PBS
> RMPOLLINTERVAL        00:02:00
> SERVERPORT            42559
> SERVERMODE            NORMAL
> LOGFILE               maui.log
> LOGFILEMAXSIZE        10000000
> LOGLEVEL              3
> QUEUETIMEWEIGHT       1
> BACKFILLPOLICY        FIRSTFIT
> RESERVATIONPOLICY     CURRENTHIGHEST
> NODEALLOCATIONPOLICY  CPULOAD
>  
> CREDWEIGHT            1
> USERWEIGHT            1
> GROUPWEIGHT           1
> CLASSWEIGHT           1
> 
> USERCFG[vanallp]      MAXNODE=2
> USERCFG[hinkle]       MAXPROC=4
> USERCFG[webmo]        MAXNODE=4 PRIORITY=100000
> USERCFG[DEFAULT]      MAXNODE=9
> GROUPCFG[DEFAULT]     MAXNODE=11
> 
> # these are the 4 queues
> CLASSCFG[webmoq]      PRIORITY=1000000
> CLASSCFG[normal]      MAXNODE=14
> CLASSCFG[debug]       MAXNODE=15
> CLASSCFG[admin]       MAXNODE=16
> 
> XFACTOR               1
> # this parm gives short wall clock jobs priority # limited to 
> 1 day... see 5.1.2.5 in Maui admin guide:)
> #                     one day!    
> XFMINWCLIMIT          1440
> 
> #<eof>
> Am I missing the obvious? 
> Thanks!
> Paul Van Allsburg
> 
> 
> _______________________________________________
> mauiusers mailing list
> mauiusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/mauiusers
> 


More information about the mauiusers mailing list