[torqueusers] Newly added compute nodes get offline state in torque 4.1.4.

Roy Dragseth roy.dragseth at cc.uit.no
Fri Jan 11 02:48:39 MST 2013


When I'm adding new compute nodes into the pool they initially get the offline 
state that needs to be cleared manually.  Is intended behaviour?  It didn't 
use to be that way earlier (aka torque version 2 and 3).


[root at hpc ~]# pbsnodes compute-0-2
compute-0-2
     state = free
     np = 2
     ntype = cluster
     status = 
rectime=1357897229,varattr=,jobs=,state=free,netload=331323,gres=,loadave=0.00,ncpus=2,physmem=2054724kb,availmem=2966412kb,totmem=3078716kb,idletime=528,nusers=0,nsessions=0,uname=Linux 
compute-0-2.local 2.6.32-279.14.1.el6.x86_64 #1 SMP Tue Nov 6 23:43:09 UTC 
2012 x86_64,opsys=linux
     mom_service_port = 15002
     mom_manager_port = 15003
     gpus = 0

[root at hpc ~]# qmgr -c "delete node compute-0-2"
[root at hpc ~]# qmgr -c "create node compute-0-2 np=2,ntype=cluster"
[root at hpc ~]# pbsnodes -l compute-0-2
compute-0-2          offline

If this is intended behaviour I need to add some logic to my torque-roll for 
Rocks to clear the offline state automatically after insertion of new nodes.


r.

-- 

  The Computer Center, University of Tromsø, N-9037 TROMSØ Norway.
	      phone:+47 77 64 41 07, fax:+47 77 64 41 00
        Roy Dragseth, Team Leader, High Performance Computing
	 Direct call: +47 77 64 62 56. email: roy.dragseth at uit.no


More information about the torqueusers mailing list