[torqueusers] big problems with CPU allocation

Garrick Staples garrick at usc.edu
Thu Sep 21 04:14:16 MDT 2006


On Thu, Sep 21, 2006 at 11:53:51AM +0200, bill alleged:
> Hello
> 
> my cluster does weird things.
> 
> 1. It does not allocate enough CPU:
> qstat -f on job show me:
>    exec_host = 
> eclssgh5/3+eclssgh5/2+eclssgh5/1+eclssgh5/0+eclssgh4/3+eclssgh4
>         /2+eclssgh4/1+eclssgh4/0+eclssgh2/3+eclssgh2/2+eclssgh2/1
> So its 11 CPU, but:
>    Resource_List.nodes = 3:ppn=4
> 
> This is reproductible. I asked for 8 CPU:
>     exec_host = eclssgh5/3+eclssgh5/2+eclssgh5/1+eclssgh5/0
>     Resource_List.nodes = 2:ppn=4
> 
> Why torque gives me not the good amount of CPU?
> 
> 2. Strangely, I switch user, and with another uid, it works (ie, it 
> gives the good amount of CPU)
> 
> 3. torque gives more job to nodes than allowed:
> eclssgh0
>      state = free
>      np = 4
>      ntype = cluster
>      jobs = 0/253.mclssgh1, 0/242.mclssgh1, 1/242.mclssgh1, 
> 1/234.mclssgh1, 2/243.mclssgh1
> 
> 4 CPU, but 5 jobs?
> 
> Users on this cluster request only nodes and CPU with
> #PBS -l nodes=x:ppn=y
> 
> What is causing this bug?

This is all fixed in 2.1.2.

-- 
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20060921/04ef10c5/attachment-0001.bin


More information about the torqueusers mailing list