[torqueusers] Maui does not know queue to node map? - queue system is failing, please HELP !
Coyle, James J [ITACD]
jjc at iastate.edu
Fri Feb 3 11:39:01 MST 2012
This is the Torque mailing list, OpenPBS has not been
maintained in a long time.
I upgraded to Torque when OpenPBS stopped being supported
about 2004 if I recall correctly.
Torque is available from http://www.adaptivecomputing.com/products/torque.php
>From: torqueusers-bounces at supercluster.org [mailto:torqueusers-
>bounces at supercluster.org] On Behalf Of Milind
>Sent: Friday, February 03, 2012 11:57 AM
>To: torqueusers at supercluster.org
>Subject: [torqueusers] Maui does not know queue to node map? - queue
>system is failing, please HELP !
>I am a cluster administrator at
>the University of Wisconsin-Madison. At our cluster we have Maui
>(3.2.5), OpenPBS 2.3 on the ROCKS 5.3 system.
>For last few days, our queue system has been haywire : the PBS
>accepts jobs and puts them in right queues, but the scheduler
>somehow does something in the middle, and the job ends up on a
>'wrong' compute node (which is not supposed to be in that queue),
>all the while PBS still lists that job as running under the right
>example, PBS shows this:
>Job id Name User Time Use
>------------------------- ---------------- --------------- --------
>60606.bardeen Cu1_a60_mov <user> 00:52:05 R
>but the job is on a compute node which is not at all in the queue
>"fast" ! The pbs nodelist (/opt/torque/server_priv/nodes ) is all
>fine, no errors in maui logs.
>In pbs logs, I get this message
>Modified at request of maui at bardeen.msae.wisc.edu
>My guess is that maui is doing something wrong / it does not know
>the correct queue - to - node mapping.
>Can someone suggest what is going on or guide me to solve this issue
>torqueusers mailing list
>torqueusers at supercluster.org
More information about the torqueusers