[torqueusers] why a mpirun job only runs on a single node

xuehai zhang hai at cs.uchicago.edu
Tue Mar 1 10:38:48 MST 2005


Tim,

Thanks for the suggestions.

>>-----------------begin of the PBS script--------------------------------------
>>
>>#PBS -l nodes=3:ppn=1
>>#PBS -l walltime=48:00:00
>>#PBS -q qsar
>>#PBS -j oe
>>#PBS -N myjob2
>>
>>cd /usr/local/exports
>>
>>echo " "
>>echo " "
>>echo "Job started on `hostname` at `date`"
>>sleep 2
>>/usr/bin/mpirun -np 3  /var/tmp/MPI_Tutorial/HelloWorld/helloWorld > myjob2_$HOSTNAME.out
>>#/usr/bin/mpirun -machinefile ${PBS_NODEFILE} -np 3 /home/globus/MPI_Tutorial/HelloWorld/helloWorld
>> > myjob2_$HOSTNAME.out
>>#/usr/bin/mpirun /home/globus/MPI_Tutorial/HelloWorld/helloWorld
>>echo " "
>>echo "Job Ended at `date`"
>>echo " "
> 
> 
> It looks like in the one mpirun command that isn't commented out you don't
> specify a machinefile. This should be set to $PBS_NODEFILE, as it is in
> the commented out mpirun.

I see. Must I provide "-np" option for mpirun each time?

> Also, I'm not sure why you're redirecting output
> to the myjob2_$HOSTNAME file ... AFAIK you won't get one output file per
> host. You may want to use the -o and -e PBS options in your script to
> control where output goes.

I will check PBS docs for using "-o" and "-e" options.

Best,

Xuehai


More information about the torqueusers mailing list