[torqueusers] big problems with CPU allocation
bill
cluster.bill at alinto.com
Thu Sep 21 03:53:51 MDT 2006
Hello
my cluster does weird things.
1. It does not allocate enough CPU:
qstat -f on job show me:
exec_host =
eclssgh5/3+eclssgh5/2+eclssgh5/1+eclssgh5/0+eclssgh4/3+eclssgh4
/2+eclssgh4/1+eclssgh4/0+eclssgh2/3+eclssgh2/2+eclssgh2/1
So its 11 CPU, but:
Resource_List.nodes = 3:ppn=4
This is reproductible. I asked for 8 CPU:
exec_host = eclssgh5/3+eclssgh5/2+eclssgh5/1+eclssgh5/0
Resource_List.nodes = 2:ppn=4
Why torque gives me not the good amount of CPU?
2. Strangely, I switch user, and with another uid, it works (ie, it
gives the good amount of CPU)
3. torque gives more job to nodes than allowed:
eclssgh0
state = free
np = 4
ntype = cluster
jobs = 0/253.mclssgh1, 0/242.mclssgh1, 1/242.mclssgh1,
1/234.mclssgh1, 2/243.mclssgh1
4 CPU, but 5 jobs?
Users on this cluster request only nodes and CPU with
#PBS -l nodes=x:ppn=y
What is causing this bug?
Thanks for any help
More information about the torqueusers
mailing list