[torqueusers] problem about running jobs on multiple nodes

Garrick Staples garrick at usc.edu
Mon Nov 12 10:17:56 MST 2007


On Mon, Nov 12, 2007 at 05:38:12PM +0100, Jacques Foury alleged:
> Chien-Pin Chou a écrit :
> >Hello:
> > 
> >I have a problem about running jobs on multiple nodes (n>1)
> > 
> >when I use qsub -l nodes=2:ppn=2 for testing,
> >but it just select 2 cpus in one node instead of choosing 2 cpus per 
> >node, which is total 4 cpus to run
> > 
> >my test script is :
> >#=========================
> >cd $PBS_O_WORKDIR
> >NPROCS=`wc -l < $PBS_NODEFILE`
> >echo $NPROCS
> >cat $PBS_NODEFILE
> >echo "...."
> >/opt/openmpi/bin/mpirun -np $NPROCS -machinefile $PBS_NODEFILE hostname
> 
> This is a VFAQ : Very Frequent Asked Question.
> 
> You must use mpiexec and not mpirun...
> 
> 
> You must read the admin manuals :
> 
> http://clusterresources.com/torquedocs/7.1mpi.shtml

This is only true for MPICH.  OpenMPI already includes TM support.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20071112/f9e895d0/attachment.bin


More information about the torqueusers mailing list