[torqueusers] torque not listening to ppn request specs

Smith, Jerry Don II jdsmit at sandia.gov
Thu Oct 27 10:18:13 MDT 2011


Looking at your accounting logs, are you getting 256 nodes total, and only
executing on the 64?  Or are you getting just 64 nodes allocated.

If getting 256 but only running on 64, you may need to add "-np 1" to your
mpirun line

Ie: mpirun -np 1 /path/to/ececutable

Jerry



On 10/27/11 10:12 AM, "DuChene, StevenX A" <stevenx.a.duchene at intel.com>
wrote:

>Jon:
>
>Thanks for the suggestions...
>
>OK, I did:
>
>$ qsub -l nodes=256:SeaMicro:ppn=1 -l tpn=1 script_noarch.pbs
>55.emcutil1.end.atom
>
>So it took the job that time but I still got packed processes instead of
>one per node. Only 64 unique nodes used with four processes per node
>instead of 256 unique nodes.
>
>GAH!
>--
>Steven DuChene
>
>
>
>-----Original Message-----
>From: Jon Wakelin [mailto:jw292 at leicester.ac.uk]
>Sent: Thursday, October 27, 2011 9:08 AM
>To: DuChene, StevenX A
>Cc: Jon Wakelin; Torque Users Mailing List
>Subject: Re: [torqueusers] torque not listening to ppn request specs
>
>On 27/10/11 17:00, DuChene, StevenX A wrote:
>> Nope; I just tried that too and got the same result. Packed processes
>>one per processor instead of one per node. If I omit the ppn=1 option
>>and just use tpn=1 I get the following:
>>
>> $ qsub -l nodes=256:SeaMicro:tpn=1 script_noarch.pbs
>> qsub: Job exceeds queue resource limits MSG=cannot locate feasible nodes
>
>
>No, the tpn is a separate resource request.  So you need to separate it
>with a comma not a semicolon
>
>    qsub -l nodes=256,tpn=1
>
>or alternatively specify -l twice
>
>    qsub -l nodes=256 -l tpn=1
>
>
>give it another go.  We see the same behaviour as you (packed as tightly
>as possible) if we only specify node=x:ppn=y .
>
>does that help?
>
>Jon
>
>>
>> -----Original Message-----
>> From: Jon Wakelin [mailto:jw292 at leicester.ac.uk]
>> Sent: Thursday, October 27, 2011 8:52 AM
>> To: DuChene, StevenX A
>> Subject: Re: [torqueusers] torque not listening to ppn request specs
>>
>> On 26/10/11 01:10, DuChene, StevenX A wrote:
>>> Hello all:
>>>
>>> I have torque 2.5.7 and maui 3.2.6p21 installed on a couple of small
>>>clusters
>>> and I am submitting the following mpi job using:
>>>
>>> qsub -l nodes=12:mynode:ppn=1 script_noarch.pbs
>>
>> I think you want tpn in addition to ppn.  Where tpn = tasks per node.
>>Note the comma not semicolon.
>>
>>
>>      qsub -l nodes=12:ppn=1,tpn=1 script_noarch.pbs
>>
>>
>> Does that give you what you want?
>>
>> It's best to think of nodes=12:ppn=1 as requesting 12 lots of 1 core, a
>>request that could be satisfied by 12 single
>> core machines or a single 12-core machine, or 2 six-core machines, and
>>so on.   It doesn't specifically request 12
>> physically separate nodes (servers).
>>
>>
>> At least, that's what we use here.
>>
>> regards
>> Jon
>
>
>-- 
>Dr Jon Wakelin
>Systems Architect	
>IT Services, University Of Leicester, LE1 7RH
>Tel.  +44 (0)116 252 5147
>_______________________________________________
>torqueusers mailing list
>torqueusers at supercluster.org
>http://www.supercluster.org/mailman/listinfo/torqueusers
>



More information about the torqueusers mailing list