[torquedev] [Bug 93] Resource management semantics of Torque need to be well defined

Michel Béland michel.beland at rqchp.qc.ca
Tue Dec 7 11:39:30 MST 2010


Martin Siegert wrote:

> I do like the -lselect syntax in principle as long as can be
> introduced as an alternative to the -l nodes=x:ppn=y syntax.
> A few comments: I do not mind to use "ncpus", but others might:
> ncpus has a long history and I do not know whether anybody is still
> using it. In my opinion it is mostly broken, thus reusing "ncpus"
> for something else is not a big deal.

We do use -lncpus with Torque on Altix machines and although it is 
broken in some ways, -lnodes is too. I wrote a qsub wrapper that defines 
both -lncpus *and* -lnodes on Altix to make sure that everything would 
work correctly : routing based on processor count, routing based on 
properties, cpusets and displaying the number of tasks with qstat 
(although I think that the latter is fixed).

PBS Pro extended the meaning of ncpus and they define exactly what it 
means to use both -lncpus and -lselect=4:ncpus=8, although I do not 
think it is useful to specify both. Also, with a select request like 
above, it would define an attribute resources_used.ncpus=32 that would 
be used for routing. Very neat. Torque out to do the same...

> I do not like "mpiprocs" and "ompthread": there can be "procs" and
> "threads" other than "mpi" and "omp". We can use "threads" instead
> of "ompthread", but we cannot use "procs" instead of "mpiprocs" -
> that is taken already. Maybe we could use "nprocs" instead?

That is why I liked ppn and cpp: they are much shorter and were not tied 
to neither MPI nor OpenMP.

-- 
Michel Béland, analyste en calcul scientifique
michel.beland at rqchp.qc.ca
bureau S-250, pavillon Roger-Gaudry (principal), Université de Montréal
téléphone : 514 343-6111 poste 3892     télécopieur : 514 343-2155
RQCHP (Réseau québécois de calcul de haute performance)  www.rqchp.ca
Calcul Canada (computecanada.org)


More information about the torquedev mailing list