[torqueusers] Torque module for pdsh ?
jbernstein at penguincomputing.com
Thu May 14 14:24:07 MDT 2009
With Scyld ClusterWare we provide server tools that work outside of TORQUE. One
tool is called bpsh and allows for a command to be executed sequetially over a
set of nodes:
$ bpsh 0,2-3 hostname
There is also a called beorun, that lauches the given command in parallel (not
MPI parallel mind you) across a given set of nodes.
Further, Scyld has a unified process space, so running a ps on the central head
node shows the PIDs running on the compute nodes!
If you are interested in reading more about Scyld ClusterWare you can check here:
Otherwise what would you be looking to add to your existing script?
Ole Holm Nielsen wrote:
> Si Hammond wrote:
>> Have you looked at the pbsdsh command?
> Yes, pbsdsh could be used within a batch job. I'm interested
> in cluster management tools where you can execute a command
> from the central administration server, for example to get
> the process status on the nodes that run a particular job.
> I currently use a simple script that loops over the job's nodes,
> see ftp://ftp.fysik.dtu.dk/pub/Torque/sshjob but I would like
> to improve this by using pdsh.
> torqueusers mailing list
> torqueusers at supercluster.org
More information about the torqueusers