[torqueusers] torque not listening to ppn request specs
DuChene, StevenX A
stevenx.a.duchene at intel.com
Thu Oct 27 10:12:35 MDT 2011
Thanks for the suggestions...
OK, I did:
$ qsub -l nodes=256:SeaMicro:ppn=1 -l tpn=1 script_noarch.pbs
So it took the job that time but I still got packed processes instead of one per node. Only 64 unique nodes used with four processes per node instead of 256 unique nodes.
From: Jon Wakelin [mailto:jw292 at leicester.ac.uk]
Sent: Thursday, October 27, 2011 9:08 AM
To: DuChene, StevenX A
Cc: Jon Wakelin; Torque Users Mailing List
Subject: Re: [torqueusers] torque not listening to ppn request specs
On 27/10/11 17:00, DuChene, StevenX A wrote:
> Nope; I just tried that too and got the same result. Packed processes one per processor instead of one per node. If I omit the ppn=1 option and just use tpn=1 I get the following:
> $ qsub -l nodes=256:SeaMicro:tpn=1 script_noarch.pbs
> qsub: Job exceeds queue resource limits MSG=cannot locate feasible nodes
No, the tpn is a separate resource request. So you need to separate it with a comma not a semicolon
qsub -l nodes=256,tpn=1
or alternatively specify -l twice
qsub -l nodes=256 -l tpn=1
give it another go. We see the same behaviour as you (packed as tightly as possible) if we only specify node=x:ppn=y .
does that help?
> -----Original Message-----
> From: Jon Wakelin [mailto:jw292 at leicester.ac.uk]
> Sent: Thursday, October 27, 2011 8:52 AM
> To: DuChene, StevenX A
> Subject: Re: [torqueusers] torque not listening to ppn request specs
> On 26/10/11 01:10, DuChene, StevenX A wrote:
>> Hello all:
>> I have torque 2.5.7 and maui 3.2.6p21 installed on a couple of small clusters
>> and I am submitting the following mpi job using:
>> qsub -l nodes=12:mynode:ppn=1 script_noarch.pbs
> I think you want tpn in addition to ppn. Where tpn = tasks per node. Note the comma not semicolon.
> qsub -l nodes=12:ppn=1,tpn=1 script_noarch.pbs
> Does that give you what you want?
> It's best to think of nodes=12:ppn=1 as requesting 12 lots of 1 core, a request that could be satisfied by 12 single
> core machines or a single 12-core machine, or 2 six-core machines, and so on. It doesn't specifically request 12
> physically separate nodes (servers).
> At least, that's what we use here.
Dr Jon Wakelin
IT Services, University Of Leicester, LE1 7RH
Tel. +44 (0)116 252 5147
More information about the torqueusers