[torquedev] Problem with TM interface when using --enable-numa
Lukasz Flis
l.flis at cyf-kr.edu.pl
Tue Mar 27 16:59:34 MDT 2012
Hi
It seems that TM interface in Torque 3.0.4 compiled with --enable-numa
flag is broken.
Example:
qsub -I -l nodes=4:ppn=1
qsub: waiting for job 307.batch-xsmp to start
qsub: job 307.batch-xsmp ready
[@xsmp4-3-1 ~]$ cat $PBS_NODEFILE
xsmp4-3-1.local
xsmp4-2-4.local
xsmp4-2-3.local
xsmp4-1-2.local
#mpiexec from openmpi compiled with TM support
mpiexec uname -n
xsmp4-3-1.local
xsmp4-3-1.local
xsmp4-3-1.local
xsmp4-3-1.local
The job above had been allocated 4 different nodes.
However mpiexec or pbsdsh runs given command 4 times on the first of
hosts from $PBS_NODE file
Is this desired behaviour? I haven't tested Torque 4.0 with numa but I
suspect it could have the same problem.
Cheers
--
LKF
More information about the torquedev
mailing list