[Mauiusers] Bug in Torque 2.3.7 with Maui 3.2.6p21: Truncated node resources

Michel Béland michel.beland at rqchp.qc.ca
Thu Jun 4 07:57:21 MDT 2009


Ole Holm Nielsen wrote:
> We're upgrading our cluster from CentOS4 to CentOS5 and would like to
> upgrade Torque/Maui as well.  We're having a problem with Torque 2.3.7
> and Maui 3.2.6p21 that's a real show-stopper: Node resources from
> Torque get truncated by Maui so that only the first resource is used.
> Obviously, this makes our test cluster rather useless at this time.
> We do not know whether the bug is a Torque or a Maui issue.
> 
> Some examples from the Maui logfile illustrate the problem when
> we submit jobs that request 2 nodes with ppn=4 on each node.
> 
> 1) Job submitted with "qsub -l nodes=2:ppn=4" is allocated only 1 node:
> 06/04 14:18:39 MRMJobStart(37,Msg,SC)
> 06/04 14:18:39 MPBSJobStart(37,0,Msg,SC)
> 06/04 14:18:39 MPBSJobModify(37,Resource_List,Resource,m038:ppn=4)
> 06/04 14:18:39 MPBSJobModify(37,Resource_List,Resource,2:ppn=4)
> 
> 2) Job submitted with "qsub -l nodes=m040:ppn=4+m035:ppn=4" is
> allocated only node m040:
> 06/04 14:21:47 MRMJobStart(41,Msg,SC)
> 06/04 14:21:47 MPBSJobStart(41,0,Msg,SC)
> 06/04 14:21:47 MPBSJobModify(41,Resource_List,Resource,m035:ppn=4)
> 06/04 14:21:47 MPBSJobModify(41,Resource_List,Resource,m040:ppn=4+m035:ppn=4)

Does it help if you add the following to your maui.cfg file?

ENABLEMULTIREQJOBS   TRUE

Do not forget to restart Maui after this change.

-- 
Michel Béland, analyste en calcul scientifique
michel.beland at rqchp.qc.ca
bureau S-250, pavillon Roger-Gaudry (principal), Université de Montréal
téléphone   : 514 343-6111 poste 3892     télécopieur : 514 343-2155
RQCHP (Réseau québécois de calcul de haute performance)  www.rqchp.qc.ca


More information about the mauiusers mailing list