[torqueusers] Torque + Maui Configuration

Tomás Soares tomas at lsd.ufcg.edu.br
Thu Oct 22 07:24:00 MDT 2009


Hello All,

I have 9 nodes with torque and maui installed and I'm using this 
versions: torque-2.3.0 and maui-3.2.6p20.
I was trying to submit a simple job using "qsub -q prod script.sh" but 
when I run qstat the jobs status is switching between Queued, E and 
Running but don't return an output and never goes out of the qstat.
Someone suspects whats wrong?

Thanks a lot!!!

The script.sh is:

#!/bin/bash
hostname -f
date

A tail -100 in the pbs logs:

10/22/2009 10:19:21;0100;PBS_Server;Req;;Type StatusJob request received 
from pbs_mom at WN12.my.host.name, sock=12
10/22/2009 10:19:21;0100;PBS_Server;Req;;Type JobObituary request 
received from pbs_mom at WN12.my.host.name, sock=12
10/22/2009 10:19:21;0040;PBS_Server;Svr;server1.my.host.name;Scheduler 
sent command new
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type StatusNode request 
received from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type StatusQueue request 
received from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type StatusJob request received 
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type ModifyJob request received 
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0008;PBS_Server;Job;6646.server1.my.host.name;Job 
Modified at request of root at server1.my.host.name
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type RunJob request received 
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0008;PBS_Server;Job;6646.server1.my.host.name;Job 
Run at request of root at server1.my.host.name
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type ModifyJob request received 
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0008;PBS_Server;Job;6647.server1.my.host.name;Job 
Modified at request of root at server1.my.host.name
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type RunJob request received 
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0008;PBS_Server;Job;6647.server1.my.host.name;Job 
Run at request of root at server1.my.host.name
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type StatusJob request received 
from pbs_mom at WN6.my.host.name, sock=13

Bellow, a little part of the maui log:

10/22 10:15:16 MJobPReserve(6624,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6615,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6618,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6622,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6626,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6629,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6630,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6636,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 INFO:     total jobs selected in partition ALL: 23/27 
[State: 4]
10/22 10:15:16 INFO:     total jobs selected in partition DEFAULT: 23/23
10/22 10:15:16 MQueueBackFill(BFQueue,HARD,DEFAULT)
10/22 10:15:16 INFO:     total jobs selected in partition ALL: 23/27 
[State: 4]
10/22 10:15:16 INFO:     current util[1043]:  9/9 (100.00%)  PH: 18.51%  
active jobs: 9 of 59 (completed: 2561)
10/22 10:15:16 INFO:     scheduling complete.  sleeping 10 seconds


More information about the torqueusers mailing list