[Mauiusers] maui doesn't start jobs

Anne M. Hammond hammond at txcorp.com
Wed Feb 28 20:57:43 MST 2007


My job just just remains queued:

[hammond at storage3 log]$ qstat -a

Req'd   Elap
Job ID               Username Queue    Jobname    SessID NDS   TSK Memory 
Time  S Time
-------------------- -------- -------- ---------- ------ ----- --- ------ 
----- - -----
86.storage3.cl.txcor swsides  s3opt8   a059104403  20584     5   2 4000mb 
24:00 R 05:27
93.storage3.cl.txcor hammond  s3opt12  s.sh          --      1   2 4000mb 
24:00 Q   --

-------
Maui version 3.2.6p19
torque-2.1.6
-------

The setup is pretty much out of the box.

Except for these changes:
# maui.cfg 3.2.6p19

SERVERHOST            storage3.cl.txcorp.com
# primary admin must be first in list
ADMIN1                hammond

# Resource Manager Definition
#
#RMCFG[STORAGE3.CL.TXCORP.COM] TYPE=PBS at RMNMHOST@
RMCFG[STORAGE3.CL.TXCORP.COM] TYPE=PBS
# ALSO TRIED
#RMCFG[BASE] TYPE=PBS

---------

where q =
# Create and define queue s3opt12
#
create queue s3opt12
set queue s3opt12 queue_type = Execution
set queue s3opt12 Priority = 30
set queue s3opt12 max_running = 1
set queue s3opt12 acl_host_enable = False
set queue s3opt12 resources_max.nodect = 12
set queue s3opt12 resources_max.walltime = 24:00:00
set queue s3opt12 resources_default.nodect = 1
set queue s3opt12 max_user_run = 1
set queue s3opt12 enabled = True
set queue s3opt12 started = True

The pbs server manager is hammond@{fully qualified hostname}

maui is running as hammond

torque qmgr has "set server scheduling=True"
torque pbs_sched daemon is not started.

[hammond at storage3 log]$ /usr/local/maui/bin/checkjob -v 93


checking job 93 (RM job '93.storage3.cl.txcorp.com')

State: Idle  EState: Deferred
Creds:  user:hammond  group:admin  class:s3opt12  qos:DEFAULT
WallTime: 00:00:00 of 1:00:00:00
SubmitTime: Wed Feb 28 20:17:35
   (Time Queued  Total: 00:27:04  Eligible: 00:00:00)

Total Tasks: 1

Req[0]  TaskCount: 1  Partition: ALL
Network: [NONE]  Memory >= 0  Disk >= 0  Swap >= 0
Opsys: [NONE]  Arch: [NONE]  Features: [NONE]
Exec:  ''  ExecSize: 0  ImageSize: 0
Dedicated Resources Per Task: PROCS: 2  MEM: 4000M
NodeAccess: SHARED
NodeCount: 1


IWD: [NONE]  Executable:  [NONE]
Bypass: 0  StartCount: 0
PartitionMask: [ALL]
SystemQueueTime: Wed Feb 28 20:42:31

Flags:       RESTARTABLE

job is deferred.  Reason:  NoResources  (cannot create reservation for job 
'93' (intital reservation attempt)
)
Holds:    Defer  (hold reason:  NoResources)
PE:  2.02  StartPriority:  2
cannot select job 93 for partition DEFAULT (job hold active)

------

pbsnodes -a shows 9 nodes free.

------

Any pointers much appreciated.

Anne



Anne M. Hammond - Systems / Network Administration - Tech-X Corp
                   hammond_at_txcorp.com 720-974-1840


More information about the mauiusers mailing list