[Mauiusers] Node idle but load is HIGH
Jan Ploski
Jan.Ploski at offis.de
Fri Sep 28 01:31:01 MDT 2007
Hello,
diagnose -n on my system gives the following message for quite a few
nodes:
WARNING: node 'node38' has been idle for 8:25:00 but load is HIGH. load:
3.020 (check for runaway processes?)
However, the node is running three jobs:
node38 Idle 4:4 7988:7988 1:1 15314:15314
1.00 linux [NONE] DEF 3.00 003 [dgiseq_4:4][verylong_4:4][sma [DEFAULT]
[dual][eth]
node38
state = free
np = 4
properties = eth,dual
ntype = cluster
jobs = 0/346565.srvgrid01.offis.uni-oldenburg.de,
1/346438.srvgrid01.offis.uni-oldenburg.de,
2/346540.srvgrid01.offis.uni-oldenburg.de
status = opsys=linux,uname=Linux node38 2.6.16.27-0.9-smp #1 SMP Tue
Feb 13 09:35:18 UTC 2007 x86_64,sessions=23026 23588
26819,nsessions=3,nusers=1,idletime=163,totmem=16003996kb,availmem=15682840kb,physmem=8180384kb,ncpus=4,loadave=3.00,netload=75435158326,state=free,jobs=346438.srvgrid01.offis.uni-oldenburg.de
346540.srvgrid01.offis.uni-oldenburg.de
346565.srvgrid01.offis.uni-oldenburg.de,rectime=1190964204
...and according to pstree these jobs are child processes of pbs_mom, so
definitely not "runaway".
How can it happen that the class initiators on a node are 4:4 even though
this node is running some jobs?
Best regards,
Jan Ploski
More information about the mauiusers
mailing list