[torqueusers] Torque and MPI jobs
Trach-Minh Tran
trach-minh.tran at epfl.ch
Sun Nov 12 00:19:19 MST 2006
Hi,
In your example, you should first start the mpd on the 4 nodes
using:
mpdboot -n 4 -f $PBS_NODEFILE
After that, to use all the 16 processors on your 4 nodes, just
run:
mpiexec -n 16 <executable> ...
Regards, -Minh
On 11/10/2006 02:49 AM, Anna Jonna Armannsdottir wrote:
> Hi
> this is about Torque and MPICH2.
>
> I am writing a pbs job description that
> starts MPI jobs on a number of machines.
>
> The pbs script specifies 4 nodes with
> 4 processors each and generates a
> PBS_NODEFILE that looks like this
>
> n001
> n001
> n001
> n001
> n002
> n002
> n002
> n002
> n003
> n003
> n003
> n003
> n004
> n004
> n004
> n004
>
> However, the mpiexec needs a file like this
> n001:4
> n002:4
> n003:4
> n004:4
>
> So I wrote a little script that does this.
> So far so good. When the mpdboot starts,
> it ignores one of the nodes and uses the
> masternode instead and refuses to start
> more than 13 processes.
>
> There must be someone that has solved this. :)
>
--
Dr. Trach-Minh Tran
CRPP/SB/EPFL http://crppwww.epfl.ch
Swiss Federal Institute of Technology
CH-1015 Lausanne
Office: http://plan.epfl.ch/?room=PPB111
Tel: +41 21 693 4569
Fax: +41 21 693 5176
More information about the torqueusers
mailing list