[torqueusers] Possible bug with auto_node_np

J.A. Magallón jamagallon at ono.com
Thu Dec 2 18:04:35 MST 2010


Hi...

Better the demo that any explanation:

bran:~/mpi> qmgr -c 'p n n0.mpi' | grep "np ="
set node n0.mpi np = 2
bran:~/mpi> qmgr -c 'p n n1.mpi' | grep "np ="
set node n1.mpi np = 2
bran:~/mpi> qsub -l nodes=2:ppn=2 k
qsub: Job exceeds queue resource limits MSG=cannot locate feasible nodes
bran:~/mpi> qmgr -c 's n n0.mpi np = 2'
bran:~/mpi> qmgr -c 'p n n0.mpi' | grep "np ="
set node n0.mpi np = 2
bran:~/mpi> qsub -l nodes=2:ppn=2 k
2.master.mpi
bran:~/mpi> qstat -n

annwn.cps.unizar.es:
                                                                         Req'd  Req'd   Elap
Job ID               Username Queue    Jobname          SessID NDS   TSK Memory Time  S Time
-------------------- -------- -------- ---------------- ------ ----- --- ------ ----- - -----
2.master.mpi         magallon std      x                   --      2   4    --    --  R   --
   n1+n1+n0+n0

First time submission fails, i re-set the same value (and only for one node)
and then it works. Weird...

Server was set with auto_node_np.

Any ideas ?

-- 
J.A. Magallon <jamagallon()ono!com>     \               Software is like sex:
                                         \         It's better when it's free


More information about the torqueusers mailing list