[torqueusers] Bizzare $PBS_O_WORKDIR bug

Chris Samuel csamuel at vpac.org
Tue Sep 4 01:29:38 MDT 2007


On Tue, 4 Sep 2007, Garrick Staples wrote:

> Which scheduler are you using?  If using moab, kill it, submit the
> "bad" job, look at 'qstat -f $jobid', start the scheduler, and look
> at 'qstat -f $jobid' again and compare the difference.

Bingo - good call Garrick!

Without Moab, and with Moab started in -P mode (to begin paused) we
have this at the end of the Variable_List:

        PBS_O_WORKDIR=/home/csamuel/tmp/tango022,PBS_O_QUEUE=run_1_day

As soon as Moab starts the job that gets corrupted to:

      PBS_O_WORKDIR=/home/csamuel/tmp/tango022UEUE=run_1_day;NODES=tango022

Moab is 5.1.0p4 on AMD64, compiled for libtorque thus:

[root at tango-m ~]# ldd /usr/local/sbin/moab
        libpthread.so.0 => /lib64/libpthread.so.0 (0x0000003a11400000)
        libm.so.6 => /lib64/libm.so.6 (0x0000003a11000000)
        libtorque.so.0 => /usr/local/torque-2.1.8/lib/libtorque.so.0 (0x00002aaaaaabe000)
        libc.so.6 => /lib64/libc.so.6 (0x0000003a10800000)
        /lib64/ld-linux-x86-64.so.2 (0x0000003a10400000)

cheers,
Chris
-- 
Christopher Samuel - (03) 9925 4751 - Systems Manager
 The Victorian Partnership for Advanced Computing
 P.O. Box 201, Carlton South, VIC 3053, Australia
VPAC is a not-for-profit Registered Research Agency
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part.
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20070904/cdc1dbb4/attachment.bin


More information about the torqueusers mailing list