[Mauiusers] Jobs in Queue Forever
C. D. Poon
cdpoon at unc.edu
Tue Nov 2 08:44:40 MST 2004
I am testing OpenPBS 2.3.16 with Maui 3.2.6p9 and have found a problem.
When I have some jobs waiting in the queue, there will always be some
which will stay in the queue forever. Those jobs will never get to
start even though there are execution hosts available. All hosts are
running RedHat Enterprise Linux 3.0 with the latest kernel 2.4.21-20.
The OpenPBS server log gives the following lines for jobs 503 and 504 as
examples.
11/02/2004 10:11:02;0008;PBS_Server;Job;503.topaz.isis.unc.edu;Job
Modified at request of root at topaz.isis.unc.edu
11/02/2004 10:11:02;0100;PBS_Server;Req;;Type 15 request received from
root at topaz.isis.unc.edu, sock=9
11/02/2004 10:11:02;0080;PBS_Server;Req;req_reject;Reject reply
code=15044, aux=0, type=15, from root at topaz.isis.unc.edu
11/02/2004 10:11:02;0100;PBS_Server;Req;;Type 11 request received from
root at topaz.isis.unc.edu, sock=9
11/02/2004 10:11:02;0008;PBS_Server;Job;504.topaz.isis.unc.edu;Job
Modified at request of root at topaz.isis.unc.edu
11/02/2004 10:11:02;0100;PBS_Server;Req;;Type 15 request received from
root at topaz.isis.unc.edu, sock=9
11/02/2004 10:11:02;0080;PBS_Server;Req;req_reject;Reject reply
code=15044, aux=0, type=15, from root at topaz.isis.unc.edu
In the Maui log, I get the following lines.
11/02 10:11:02 ERROR: job '503' cannot be started: (rc: 15044
errmsg: 'Resource temporarily unavailable' hostlist:
'bc02-n07.isis.unc.edu')
11/02 10:11:02 ERROR: job '504' cannot be started: (rc: 15044
errmsg: 'Resource temporarily unavailable' hostlist:
'bc02-n07.isis.unc.edu')
Has anyone seen this problem before? Is it OpenPBS problem or Maui or a
combination of both? Would it be a problem with configuration in either
OpenPBS or Maui?
Thanks,
CD Poon
More information about the mauiusers
mailing list