[torquedev] [Bug 165] New: qstat -a reports wrong 0 value in TSK column if nodes is a hostname

bugzilla-daemon at supercluster.org bugzilla-daemon at supercluster.org
Sat Dec 10 10:06:01 MST 2011


http://www.clusterresources.com/bugzilla/show_bug.cgi?id=165

           Summary: qstat -a reports wrong 0 value in TSK column if nodes
                    is a hostname
           Product: TORQUE
           Version: 3.0.x
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: minor
          Priority: P5
         Component: clients
        AssignedTo: knielson at adaptivecomputing.com
        ReportedBy: livelfs at free.fr
                CC: torquedev at supercluster.org
   Estimated Hours: 0.0


cat ppn2_TSK0.batch
#!/bin/sh
#PBS -S /bin/sh
#PBS -l nodes=horizon11:ppn=2
#PBS -N Npp
#PBS -j oe
sleep 22m

qsub ppn2_TSK0.batch
246.horizon

horizon: ~/torque_tests > qstat -a

horizon.iap.fr: 
                                                                         Req'd 
Req'd   Elap
Job ID               Username Queue    Jobname          SessID NDS   TSK Memory
Time  S Time
-------------------- -------- -------- ---------------- ------ ----- --- ------
----- - -----
246.horizon          rouberol batch    Npp                4445     1   0    -- 
04:00 R   -- 


TSK value is 0 instead of 2

The problem comes from torque-3.0.3/src/cmds/qstat.c code, line 709 
in 3.0.3 version:

int nodes = atoi(pat->value);

This returns 0 if pat->value does not begin with a number, like "horizon11" in
the job script example above.

The qstat.c code should distinguish between the 2 possibilities indicated 
in http://www.clusterresources.com/torquedocs21/2.1jobsubmission.shtml:
nodes={<node_count> | <hostname>} to get an accurate value of TSK in case
of nodes=<hostname> use.

Regards,
sr

-- 
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


More information about the torquedev mailing list