[torqueusers] routing queue not assigning jobs as per expected
sduchene at sgi.com
Fri Sep 28 15:03:53 MDT 2007
I am struggling with a multiple architecture setup where some nodes are
x86_64 and another piece is ia64 shared memory system. I have set
attributes on the nodes of either x86_64 or ia64 and I have two queues
setup for each group of nodes.
I also have a routing queue setup as the default queue.
The attributes on the nodes I set using:
qmgr -c "set node oscarnode1 properties += x86_64"
qmgr -c "set node oscar_a450 properties += ia64"
When I submit jobs I do:
qsub -l nodes=1:ia64 myjob_ia64.pbs
qsub -l nodes=1:x86_64 myjob_x86_64.pbs
where both of these job scripts do a veryt simple call to hostname.
When I submit either of the above jobs it ends up going to the x86_64
queue. If it is a job asking for x86_64 attribute
nodes it runs right through and I get the expected output. If I ask for
nodes with an attribute of ia64, it still gets sent to the
x86_64 queue but the job just stalls since that queue does not have any
resources with attributes of ia64. I was thinking
this might be a problem with my moab configuration but Doug Wightman
from CRI said I might not have my routing
queue setup properly and suggested I ask about that over here on the
torqueusers mailing list.
My print server output looks like the following:
# Create queues and set their attributes.
# Create and define queue batchx86
create queue batchx86
set queue batchx86 queue_type = Execution
set queue batchx86 acl_host_enable = False
set queue batchx86 acl_hosts = oscarnode4
set queue batchx86 acl_hosts += oscarnode3
set queue batchx86 acl_hosts += oscarnode2
set queue batchx86 acl_hosts += oscarnode1
set queue batchx86 resources_default.walltime = 01:00:00
set queue batchx86 enabled = True
set queue batchx86 started = True
# Create and define queue batchia64
create queue batchia64
set queue batchia64 queue_type = Execution
set queue batchia64 acl_host_enable = False
set queue batchia64 acl_hosts = oscar_a450
set queue batchia64 resources_default.walltime = 01:00:00
set queue batchia64 enabled = True
set queue batchia64 started = True
# Create and define queue route
create queue route
set queue route queue_type = Route
set queue route route_destinations = batchx86
set queue route route_destinations += batchia64
set queue route enabled = True
set queue route started = True
# Set server attributes.
set server default_queue = route
set server log_events = 511
set server mail_from = adm
set server scheduler_iteration = 600
set server node_check_rate = 150
set server tcp_timeout = 6
set server pbs_version = 2.1.9
Does this look like it should be working ok to send jobs asking for
nodes with ia64 attributes to the right execution queue
with those resources?
Any hints or informational pointers would be most appreciated.
Steven A. DuChene
More information about the torqueusers