[torqueusers] Need help with exclusive nodes
Avinash Kewalramani
avinash at lanl.gov
Tue Nov 13 15:09:30 MST 2007
Hi all
I have my server_privs/nodes file set up like this
[avinash at sphere ~]$ more /opt/torque/server_priv/nodes
compute-0-1.local np=1
compute-0-2.local np=1
compute-0-3.local np=1
compute-0-4.local np=1
compute-0-5.local np=1
When I submit job all nodes except compute-0-1 accept only a single job.
has anyone seent his before.Any suggestions on how to fix this
pbsnodes o/p is below.compute-0-1 shows two jobs and it always accepts
only 2 jobs
IALSO TRIED TO REMOVE COMPUTE-0-1 FROM THE NODE LIST.BUT THEN COMPUTE-0-2
STARTS BEHAVING THIS WAY (i.e accepting 2 jobs).SO SEEMS TO ME THE FIRST
NODE
ACCEPTS 2 JOBS....WHY??
[avinash at sphere ~]$ pbsnodes -a
compute-0-1.local
state = job-sharing
np = 1
ntype = cluster
jobs = 0/837.sphere.lanl.gov, 0/836.sphere.lanl.gov
status = opsys=linux,uname=Linux compute-0-1.local
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007
i686,sessions=4846
4862,nsessions=2,nusers=1,idletime=362625,totmem=5138644kb,availmem=4968680kb,physmem=8312832kb,ncpus=4,loadave=0.08,netload=1080110744,state=free,jobs=836.sphere.lanl.gov
837.sphere.lanl.gov,rectime=1194986658
compute-0-2.local
state = job-sharing
np = 1
ntype = cluster
jobs = 0/838.sphere.lanl.gov
status = opsys=linux,uname=Linux compute-0-2.local
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 i686,sessions=?
0,nsessions=?
0,nusers=0,idletime=362548,totmem=5138644kb,availmem=4881592kb,physmem=8312832kb,ncpus=4,loadave=0.00,netload=4122744615,state=free,jobs=?
0,rectime=1194986619
compute-0-3.local
state = job-sharing
np = 1
ntype = cluster
jobs = 0/839.sphere.lanl.gov
status = opsys=linux,uname=Linux compute-0-3.local
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 i686,sessions=?
0,nsessions=?
0,nusers=0,idletime=362531,totmem=5138644kb,availmem=5000836kb,physmem=8312832kb,ncpus=4,loadave=0.00,netload=3726441237,state=free,jobs=?
0,rectime=1194986625
compute-0-4.local
state = job-sharing
np = 1
ntype = cluster
jobs = 0/840.sphere.lanl.gov
status = opsys=linux,uname=Linux compute-0-4.local
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 i686,sessions=?
0,nsessions=?
0,nusers=0,idletime=726334,totmem=5138644kb,availmem=4999960kb,physmem=8312832kb,ncpus=4,loadave=0.00,netload=2000154370,state=free,jobs=?
0,rectime=1194986631
compute-0-5.local
state = job-sharing
np = 1
ntype = cluster
jobs = 0/841.sphere.lanl.gov
status = opsys=linux,uname=Linux compute-0-5.local
2.6.9-55.0.2.ELsmp #1 SMP Tue Jun 26 14:30:58 EDT 2007 i686,sessions=?
0,nsessions=?
0,nusers=0,idletime=362632,totmem=5138644kb,availmem=4992332kb,physmem=8312832kb,ncpus=4,loadave=0.00,netload=3802968042,state=free,jobs=?
0,rectime=1194986637
Thanks
Avinash
More information about the torqueusers
mailing list