[torqueusers] reading the "Time Use" in qstat
David Golden
dgolden at cp.dias.ie
Wed Sep 13 05:36:01 MDT 2006
On Wednesday 13 September 2006 10:47, bill wrote:
> the CPU consumption is stable, around 90% all the time. So why cput
> doesn't grow up? The job is running on two nodes.
>
> Thanks for any help
Okay, another possibility (really a FAQ): if you're running a parallel job,
what launcher are you using? You need to launch your parallel processes
using a program that speaks to the PBS TM api. This includes (non-exhaustive
list, probably): pbsdsh included with torque, OSC mpiexec for use with mpich
and derivatives (n.b. usable for non-mpi parallel launching too),
openrte/mpiexec from Open MPI, LAM mpiexec.
But NOT: mpirun from mpich and derivatives, mpiexec from mpich2 and
derivatives, ssh. They effectively bypass the batch system, and the
batch system cannot therefore track their CPU usage.
You might want to scan through the wiki for torque, e.g.:
http://www.clusterresources.com/wiki/doku.php?id=torque:7.1_mpi_support
More information about the torqueusers
mailing list