[Mauiusers] Jobs in Queue Forever

C. D. Poon cdpoon at unc.edu
Tue Nov 2 08:44:40 MST 2004


I am testing OpenPBS 2.3.16 with Maui 3.2.6p9 and have found a problem.  
When I have some jobs waiting in the queue, there will always be some 
which will stay in the queue forever.  Those jobs will never get to 
start even though there are execution hosts available.  All hosts are 
running RedHat Enterprise Linux 3.0 with the latest kernel 2.4.21-20.  
The OpenPBS server log gives the following lines for jobs 503 and 504 as 
examples.

11/02/2004 10:11:02;0008;PBS_Server;Job;503.topaz.isis.unc.edu;Job 
Modified at request of root at topaz.isis.unc.edu
11/02/2004 10:11:02;0100;PBS_Server;Req;;Type 15 request received from 
root at topaz.isis.unc.edu, sock=9
11/02/2004 10:11:02;0080;PBS_Server;Req;req_reject;Reject reply 
code=15044, aux=0, type=15, from root at topaz.isis.unc.edu
11/02/2004 10:11:02;0100;PBS_Server;Req;;Type 11 request received from 
root at topaz.isis.unc.edu, sock=9
11/02/2004 10:11:02;0008;PBS_Server;Job;504.topaz.isis.unc.edu;Job 
Modified at request of root at topaz.isis.unc.edu
11/02/2004 10:11:02;0100;PBS_Server;Req;;Type 15 request received from 
root at topaz.isis.unc.edu, sock=9
11/02/2004 10:11:02;0080;PBS_Server;Req;req_reject;Reject reply 
code=15044, aux=0, type=15, from root at topaz.isis.unc.edu

In the Maui log, I get the following lines.

11/02 10:11:02 ERROR:    job '503' cannot be started: (rc: 15044  
errmsg: 'Resource temporarily unavailable'  hostlist: 
'bc02-n07.isis.unc.edu')
11/02 10:11:02 ERROR:    job '504' cannot be started: (rc: 15044  
errmsg: 'Resource temporarily unavailable'  hostlist: 
'bc02-n07.isis.unc.edu')

Has anyone seen this problem before?  Is it OpenPBS problem or Maui or a 
combination of both?  Would it be a problem with configuration in either 
OpenPBS or Maui?

Thanks,

CD Poon



More information about the mauiusers mailing list