[torqueusers] Torque + Maui Configuration
Tomás Soares
tomas at lsd.ufcg.edu.br
Thu Oct 22 07:24:00 MDT 2009
Hello All,
I have 9 nodes with torque and maui installed and I'm using this
versions: torque-2.3.0 and maui-3.2.6p20.
I was trying to submit a simple job using "qsub -q prod script.sh" but
when I run qstat the jobs status is switching between Queued, E and
Running but don't return an output and never goes out of the qstat.
Someone suspects whats wrong?
Thanks a lot!!!
The script.sh is:
#!/bin/bash
hostname -f
date
A tail -100 in the pbs logs:
10/22/2009 10:19:21;0100;PBS_Server;Req;;Type StatusJob request received
from pbs_mom at WN12.my.host.name, sock=12
10/22/2009 10:19:21;0100;PBS_Server;Req;;Type JobObituary request
received from pbs_mom at WN12.my.host.name, sock=12
10/22/2009 10:19:21;0040;PBS_Server;Svr;server1.my.host.name;Scheduler
sent command new
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type StatusNode request
received from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type StatusQueue request
received from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type StatusJob request received
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type ModifyJob request received
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0008;PBS_Server;Job;6646.server1.my.host.name;Job
Modified at request of root at server1.my.host.name
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type RunJob request received
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0008;PBS_Server;Job;6646.server1.my.host.name;Job
Run at request of root at server1.my.host.name
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type ModifyJob request received
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0008;PBS_Server;Job;6647.server1.my.host.name;Job
Modified at request of root at server1.my.host.name
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type RunJob request received
from root at 32.55.125.0, sock=11
10/22/2009 10:19:22;0008;PBS_Server;Job;6647.server1.my.host.name;Job
Run at request of root at server1.my.host.name
10/22/2009 10:19:22;0100;PBS_Server;Req;;Type StatusJob request received
from pbs_mom at WN6.my.host.name, sock=13
Bellow, a little part of the maui log:
10/22 10:15:16 MJobPReserve(6624,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6615,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6618,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6622,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6626,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6629,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6630,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 MJobPReserve(6636,DEFAULT,ResCount,ResCountRej)
10/22 10:15:16 INFO: total jobs selected in partition ALL: 23/27
[State: 4]
10/22 10:15:16 INFO: total jobs selected in partition DEFAULT: 23/23
10/22 10:15:16 MQueueBackFill(BFQueue,HARD,DEFAULT)
10/22 10:15:16 INFO: total jobs selected in partition ALL: 23/27
[State: 4]
10/22 10:15:16 INFO: current util[1043]: 9/9 (100.00%) PH: 18.51%
active jobs: 9 of 59 (completed: 2561)
10/22 10:15:16 INFO: scheduling complete. sleeping 10 seconds
More information about the torqueusers
mailing list