[torqueusers] torque not listening to ppn request specs

DuChene, StevenX A stevenx.a.duchene at intel.com
Thu Oct 27 10:12:35 MDT 2011


Jon:

Thanks for the suggestions...

OK, I did:

$ qsub -l nodes=256:SeaMicro:ppn=1 -l tpn=1 script_noarch.pbs
55.emcutil1.end.atom

So it took the job that time but I still got packed processes instead of one per node. Only 64 unique nodes used with four processes per node instead of 256 unique nodes.

GAH!
--
Steven DuChene



-----Original Message-----
From: Jon Wakelin [mailto:jw292 at leicester.ac.uk] 
Sent: Thursday, October 27, 2011 9:08 AM
To: DuChene, StevenX A
Cc: Jon Wakelin; Torque Users Mailing List
Subject: Re: [torqueusers] torque not listening to ppn request specs

On 27/10/11 17:00, DuChene, StevenX A wrote:
> Nope; I just tried that too and got the same result. Packed processes one per processor instead of one per node. If I omit the ppn=1 option and just use tpn=1 I get the following:
>
> $ qsub -l nodes=256:SeaMicro:tpn=1 script_noarch.pbs
> qsub: Job exceeds queue resource limits MSG=cannot locate feasible nodes


No, the tpn is a separate resource request.  So you need to separate it with a comma not a semicolon

    qsub -l nodes=256,tpn=1

or alternatively specify -l twice

    qsub -l nodes=256 -l tpn=1


give it another go.  We see the same behaviour as you (packed as tightly as possible) if we only specify node=x:ppn=y .

does that help?

Jon

>
> -----Original Message-----
> From: Jon Wakelin [mailto:jw292 at leicester.ac.uk]
> Sent: Thursday, October 27, 2011 8:52 AM
> To: DuChene, StevenX A
> Subject: Re: [torqueusers] torque not listening to ppn request specs
>
> On 26/10/11 01:10, DuChene, StevenX A wrote:
>> Hello all:
>>
>> I have torque 2.5.7 and maui 3.2.6p21 installed on a couple of small clusters
>> and I am submitting the following mpi job using:
>>
>> qsub -l nodes=12:mynode:ppn=1 script_noarch.pbs
>
> I think you want tpn in addition to ppn.  Where tpn = tasks per node.   Note the comma not semicolon.
>
>
>      qsub -l nodes=12:ppn=1,tpn=1 script_noarch.pbs
>
>
> Does that give you what you want?
>
> It's best to think of nodes=12:ppn=1 as requesting 12 lots of 1 core, a request that could be satisfied by 12 single
> core machines or a single 12-core machine, or 2 six-core machines, and so on.   It doesn't specifically request 12
> physically separate nodes (servers).
>
>
> At least, that's what we use here.
>
> regards
> Jon


-- 
Dr Jon Wakelin
Systems Architect	
IT Services, University Of Leicester, LE1 7RH
Tel.  +44 (0)116 252 5147


More information about the torqueusers mailing list