[torqueusers] Torque and MPI jobs

Anna Jonna Armannsdottir annaj at hi.is
Thu Nov 9 18:49:03 MST 2006


Hi 
this is about Torque and MPICH2. 

I am writing a pbs job description that 
starts MPI jobs on a number of machines. 

The pbs script specifies 4 nodes with 
4 processors each and generates a 
PBS_NODEFILE that looks like this

n001
n001
n001
n001
n002
n002
n002
n002
n003
n003
n003
n003
n004
n004
n004
n004

However, the mpiexec needs a file like this 
n001:4
n002:4
n003:4
n004:4

So I wrote a little script that does this. 
So far so good. When the mpdboot starts, 
it ignores one of the nodes and uses the 
masternode instead and refuses to start 
more than 13 processes. 

There must be someone that has solved this. :) 

-- 
Kindest Regards, Anna Jonna Ármannsdóttir,
Unix System Aministration, Computing Services, 
University of Iceland.



More information about the torqueusers mailing list