[torqueusers] momctl error

Jerry Smith jdsmit at sandia.gov
Mon Mar 5 09:02:03 MST 2007


Michael,

> Hi,
>   I've got torque running on a linux cluster.
> On the master node I run this command 'momctl -d 3'.
> Any ideas on what this means?
> 

 -d is for diagnostic messages related to a particular mom (compute)  node.


See http://www.clusterresources.com/wiki/doku.php?id=torque:commands:momctl


> Also, for the command 'tracejob', how do I tell what the "<JOBID>" is?

JOBID is the number of the job you are interested in.

Ie... 1234.<torque-server-name>

On one of our machines a jobid may be 1234.ladmin2


Jerry




More information about the torqueusers mailing list