[torqueusers] Problem with TM interface when using --enable-numa

Lukasz Flis l.flis at cyf-kr.edu.pl
Tue Mar 27 16:59:34 MDT 2012


Hi

It seems that TM interface in Torque 3.0.4 compiled with --enable-numa 
flag is broken.

Example:

qsub -I -l nodes=4:ppn=1
qsub: waiting for job 307.batch-xsmp to start
qsub: job 307.batch-xsmp ready


[@xsmp4-3-1 ~]$ cat $PBS_NODEFILE
xsmp4-3-1.local
xsmp4-2-4.local
xsmp4-2-3.local
xsmp4-1-2.local

#mpiexec from openmpi compiled with TM support
mpiexec uname -n
xsmp4-3-1.local
xsmp4-3-1.local
xsmp4-3-1.local
xsmp4-3-1.local


The job above had been allocated 4 different nodes.
However mpiexec or pbsdsh runs given command 4 times on the first of 
hosts from $PBS_NODE file

Is this desired behaviour? I haven't tested Torque 4.0 with numa but I 
suspect it could have the same problem.

Cheers
--
LKF


More information about the torqueusers mailing list