[torqueusers] Torque and MPI jobs

Trach-Minh Tran trach-minh.tran at epfl.ch
Sun Nov 12 00:19:19 MST 2006


Hi,

In your example, you should first start the mpd on the 4 nodes
using:

      mpdboot -n 4 -f $PBS_NODEFILE

After that, to use all the 16 processors on your 4 nodes, just 
run:

      mpiexec -n 16 <executable> ...

Regards,   -Minh

On 11/10/2006 02:49 AM, Anna Jonna Armannsdottir wrote:
> Hi 
> this is about Torque and MPICH2. 
> 
> I am writing a pbs job description that 
> starts MPI jobs on a number of machines. 
> 
> The pbs script specifies 4 nodes with 
> 4 processors each and generates a 
> PBS_NODEFILE that looks like this
> 
> n001
> n001
> n001
> n001
> n002
> n002
> n002
> n002
> n003
> n003
> n003
> n003
> n004
> n004
> n004
> n004
> 
> However, the mpiexec needs a file like this 
> n001:4
> n002:4
> n003:4
> n004:4
> 
> So I wrote a little script that does this. 
> So far so good. When the mpdboot starts, 
> it ignores one of the nodes and uses the 
> masternode instead and refuses to start 
> more than 13 processes. 
> 
> There must be someone that has solved this. :) 
> 


-- 
Dr. Trach-Minh Tran
CRPP/SB/EPFL http://crppwww.epfl.ch
Swiss Federal Institute of Technology
CH-1015 Lausanne
Office: http://plan.epfl.ch/?room=PPB111
Tel: +41 21 693 4569
Fax: +41 21 693 5176


More information about the torqueusers mailing list