[torqueusers] PBSş LAM_MPI error

Onur Destanoğlu odestanoglu at gmail.com
Mon Aug 22 07:08:24 MDT 2005


Hi all, i have finally find my correct eror report (this seems ironic :)) )

eror report file===>
[root at bee01 ~]# cat /var/spool/torque/undelivered/31.bee00.be.ER 
-----------------------------------------------------------------------------
It seems that there is no lamd running on the host bee01.

This indicates that the LAM/MPI runtime environment is not operating.
The LAM/MPI runtime environment is necessary for the "mpirun" command.

Please run the "lamboot" command the start the LAM/MPI runtime
environment.  See the LAM/MPI documentation for how to invoke
"lamboot" across multiple machines.
-----------------------------------------------------------------------------

lam mpi is running on master node (bee00) and it accept nodes files
that includes cluster nodes.
do i have to run LAM_MPI  on cluster nodes? this means that i have to
create same accounts on cluster nodes and run "lamboot -v hosts" on
each of them. This sounds ridiculous.


More information about the torqueusers mailing list