[torqueusers] Need help with exclusive nodes

Avinash Kewalramani avinash at lanl.gov
Tue Nov 13 15:09:30 MST 2007


Hi all

I have my server_privs/nodes file set up like this

[avinash at sphere ~]$ more /opt/torque/server_priv/nodes
compute-0-1.local np=1
compute-0-2.local np=1
compute-0-3.local np=1
compute-0-4.local np=1
compute-0-5.local np=1

When I submit job all nodes except compute-0-1  accept only a single job.

has anyone seent his before.Any suggestions on how to fix this

pbsnodes o/p is below.compute-0-1 shows two jobs  and it always accepts 
only 2 jobs

IALSO TRIED TO REMOVE COMPUTE-0-1 FROM THE NODE LIST.BUT THEN COMPUTE-0-2
STARTS BEHAVING THIS WAY (i.e accepting 2 jobs).SO SEEMS TO ME THE FIRST 
NODE
ACCEPTS 2 JOBS....WHY??




[avinash at sphere ~]$ pbsnodes -a
compute-0-1.local
    state = job-sharing
    np = 1
    ntype = cluster
    jobs = 0/837.sphere.lanl.gov, 0/836.sphere.lanl.gov
    status = opsys=linux,uname=Linux compute-0-1.local 
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 
i686,sessions=4846 
4862,nsessions=2,nusers=1,idletime=362625,totmem=5138644kb,availmem=4968680kb,physmem=8312832kb,ncpus=4,loadave=0.08,netload=1080110744,state=free,jobs=836.sphere.lanl.gov 
837.sphere.lanl.gov,rectime=1194986658

compute-0-2.local
    state = job-sharing
    np = 1
    ntype = cluster
    jobs = 0/838.sphere.lanl.gov
    status = opsys=linux,uname=Linux compute-0-2.local 
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 i686,sessions=? 
0,nsessions=? 
0,nusers=0,idletime=362548,totmem=5138644kb,availmem=4881592kb,physmem=8312832kb,ncpus=4,loadave=0.00,netload=4122744615,state=free,jobs=? 
0,rectime=1194986619

compute-0-3.local
    state = job-sharing
    np = 1
    ntype = cluster
    jobs = 0/839.sphere.lanl.gov
    status = opsys=linux,uname=Linux compute-0-3.local 
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 i686,sessions=? 
0,nsessions=? 
0,nusers=0,idletime=362531,totmem=5138644kb,availmem=5000836kb,physmem=8312832kb,ncpus=4,loadave=0.00,netload=3726441237,state=free,jobs=? 
0,rectime=1194986625

compute-0-4.local
    state = job-sharing
    np = 1
    ntype = cluster
    jobs = 0/840.sphere.lanl.gov
    status = opsys=linux,uname=Linux compute-0-4.local 
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 i686,sessions=? 
0,nsessions=? 
0,nusers=0,idletime=726334,totmem=5138644kb,availmem=4999960kb,physmem=8312832kb,ncpus=4,loadave=0.00,netload=2000154370,state=free,jobs=? 
0,rectime=1194986631

compute-0-5.local
    state = job-sharing
    np = 1
    ntype = cluster
    jobs = 0/841.sphere.lanl.gov
    status = opsys=linux,uname=Linux compute-0-5.local 
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 i686,sessions=? 
0,nsessions=? 
0,nusers=0,idletime=362632,totmem=5138644kb,availmem=4992332kb,physmem=8312832kb,ncpus=4,loadave=0.00,netload=3802968042,state=free,jobs=? 
0,rectime=1194986637

Thanks
Avinash



More information about the torqueusers mailing list