[torquedev] [Bug 65] New: TSK reports 2 when you ask for nodes=1:ppn=1 in a job script vs. command line

Ken Nielson knielson at adaptivecomputing.com
Mon Jun 28 07:10:14 MDT 2010


That is because in the script your requested -l nodes=1:ppn=2. ppn=2 gives you two tasks.

Ken Nielson
Adaptive Computing

----- Original Message -----
From: bugzilla-daemon at supercluster.org
To: torquedev at supercluster.org
Sent: Monday, June 28, 2010 7:01:33 AM
Subject: [torquedev] [Bug 65] New: TSK reports 2 when you ask for nodes=1:ppn=1 in a job script vs. command line

http://www.clusterresources.com/bugzilla/show_bug.cgi?id=65

           Summary: TSK reports 2 when you ask for nodes=1:ppn=1 in a job
                    script vs. command line
           Product: TORQUE
           Version: 2.4.x
          Platform: All
        OS/Version: Linux
            Status: NEW
          Severity: major
          Priority: P5
         Component: clients
        AssignedTo: glen.beane at gmail.com
        ReportedBy: chris at adaptivecomputing.com
                CC: torquedev at supercluster.org
   Estimated Hours: 0.0


All,

As you can see in the following output when you do a qsub -l nodes=1:ppn=1 you
get the right amount of tasks but when you create a job script and submit it
you get two tasks instead of one.

Version:   2.4.8

Best Regards,


[adaptcomp at xcathead ~]$ cat script.sh
#!/bin/bash
#PBS -l nodes=1:ppn=2
[adaptcomp at xcathead ~]$ qsub script.sh
12.xcathead
[adaptcomp at xcathead ~]$ echo sleep 333 | qsub -l nodes=1:ppn=1
13.xcathead
[adaptcomp at xcathead ~]$ qstat -a

xcathead:
                                                                         Req'd 
Req'd   Elap
Job ID               Username Queue    Jobname          SessID NDS   TSK Memory
Time  S Time
-------------------- -------- -------- ---------------- ------ ----- --- ------
----- - -----
12.xcathead          adaptcom batch    script.sh           --      1   2    -- 
01:00 Q   --
13.xcathead          adaptcom batch    STDIN               --      1   1    -- 
01:00 Q   --
[adaptcomp at xcathead ~]$ qstat -f 12
Job Id: 12.xcathead
    Job_Name = script.sh
    Job_Owner = adaptcomp at xcathead
    job_state = Q
    queue = batch
    server = xcathead
    Checkpoint = u
    ctime = Mon Jun 28 13:52:18 2010
    Error_Path = xcathead:/home/adaptcomp/script.sh.e12
    Hold_Types = n
    Join_Path = n
    Keep_Files = n
    Mail_Points = a
    mtime = Mon Jun 28 13:52:18 2010
    Output_Path = xcathead:/home/adaptcomp/script.sh.o12
    Priority = 0
    qtime = Mon Jun 28 13:52:18 2010
    Rerunable = True
    Resource_List.nodect = 1
    Resource_List.nodes = 1:ppn=2
    Resource_List.walltime = 01:00:00
    Variable_List = PBS_O_HOME=/home/adaptcomp,PBS_O_LANG=en_US.UTF-8,
        PBS_O_LOGNAME=adaptcomp,
        PBS_O_PATH=/usr/kerberos/sbin:/usr/kerberos/bin:/usr/local/sbin:/usr/
        local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/opt/xcat/bin:/opt/xcat/sbin:/
        root/bin:/opt/xcat/bin:/opt/xcat/sbin,PBS_O_MAIL=/var/spool/mail/root,
        PBS_O_SHELL=/bin/bash,PBS_SERVER=xcathead,
        PBS_O_WORKDIR=/home/adaptcomp,PBS_O_QUEUE=batch,PBS_O_HOST=xcathead
    comment = cannot create reservation for job '12' (requested resources not
        available at any time)
cannot create reservation for job '12' (request
        ed resources not available at any time)

    etime = Mon Jun 28 13:52:18 2010
    submit_args = script.sh
    Walltime.Remaining = 360
    fault_tolerant = False

[adaptcomp at xcathead ~]$ qstat -f 13
Job Id: 13.xcathead
    Job_Name = STDIN
    Job_Owner = adaptcomp at xcathead
    job_state = Q
    queue = batch
    server = xcathead
    Checkpoint = u
    ctime = Mon Jun 28 13:52:21 2010
    Error_Path = xcathead:/home/adaptcomp/STDIN.e13
    Hold_Types = n
    Join_Path = n
    Keep_Files = n
    Mail_Points = a
    mtime = Mon Jun 28 13:52:22 2010
    Output_Path = xcathead:/home/adaptcomp/STDIN.o13
    Priority = 0
    qtime = Mon Jun 28 13:52:21 2010
    Rerunable = True
    Resource_List.nodect = 1
    Resource_List.nodes = 1:ppn=1
    Resource_List.walltime = 01:00:00
    Variable_List = PBS_O_HOME=/home/adaptcomp,PBS_O_LANG=en_US.UTF-8,
        PBS_O_LOGNAME=adaptcomp,
        PBS_O_PATH=/usr/kerberos/sbin:/usr/kerberos/bin:/usr/local/sbin:/usr/
        local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/opt/xcat/bin:/opt/xcat/sbin:/
        root/bin:/opt/xcat/bin:/opt/xcat/sbin,PBS_O_MAIL=/var/spool/mail/root,
        PBS_O_SHELL=/bin/bash,PBS_SERVER=xcathead,
        PBS_O_WORKDIR=/home/adaptcomp,PBS_O_QUEUE=batch,PBS_O_HOST=xcathead
    comment = cannot create reservation for job '13' (requested resources not
        available at any time)
cannot create reservation for job '13' (request
        ed resources not available at any time)

    etime = Mon Jun 28 13:52:21 2010
    submit_args = -l nodes=1:ppn=1
    Walltime.Remaining = 360
    fault_tolerant = False

[adaptcomp at xcathead ~]$ qsub --about
HomeDir:   /var/spool/torque  InstallDir: /usr/local  Server: xcathead
BuildDir:  /root/torque_moab/torque-2.4.8
BuildUser: root
BuildHost: xcathead
BuildDate: Wed Jun 16 11:54:49 BST 2010
Version:   2.4.8

-- 
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
_______________________________________________
torquedev mailing list
torquedev at supercluster.org
http://www.supercluster.org/mailman/listinfo/torquedev


More information about the torquedev mailing list