[torqueusers] What is cartwire.ncsa.uiuc.edu?
lorenzo118 at interfree.it
Wed Oct 19 02:52:06 MDT 2005
Thank you for your answer, I'll check how mpirun works, but I used mpirun
alone (out of torque) with a simple
mpirun -np 2 helloMPI
and everything works fine.
Anyway thank you very much for the hint!
At 20.59 18/10/2005, you wrote:
>On Tue, Oct 18, 2005 at 10:48:01AM +0200, Lorenzo Campo alleged:
> > Dave,
> > no, I'm using the source of torque 1.2.0p6, downloaded and compiled on my
> > machines following the quickstart guide (and compiling with --with-scp
> > option). I'm using the pbs_sched scheduler, no RPM or other binary at all.
> > There is some issue in running jobs with more than one compute node?
> > Lorenzo
>TORQUE routinely runs jobs with hundreds of nodes probably every second
>of every day.
>Looking at those error messages again, I think this is in your MPI
>stuff, not in TORQUE.
>Which MPI implementation are you using? How are you running helloMPI?
>I'm guessing you might be using MPICH with the p4 device, in which case,
>make sure you run 'mpirun' with some variation of:
> mpirun -machinefile $PBS_NODEFILE -np `wc -l < $PBS_NODEFILE`
>Alternatively, mpiexec from http://www.osc.edu/~pw/mpiexec/ integrates
>very nicely with TORQUE and is widely used and recommended.
>Garrick Staples, Linux/HPCC Administrator
>University of Southern California
>torqueusers mailing list
>torqueusers at supercluster.org
More information about the torqueusers