[torqueusers] Special job for reboot
arnaubria at pic.es
Thu Jan 28 07:05:27 MST 2010
this issue is a little OT, but I'd like to know other admin experiences.
Someone already asked this some time ago:
But I don't find the solution he implemented and if it worked or not.
I've seen a couple of good ideas like the one from Brock Palen
recommending a job that requests a complet node and special host (#PBS
-l host=$host,naccesspolicy=SINGLEJOB) and the other from Garrick :
"First, you need to drain the nodes by marking them offline. Then you
need to mark them for reboot using the node note. Then a script can
reboot nodes when it finds them offline, without a job, and marked for
But is someone really doing reboot via torque? What are your steps when
you need to reboot your farm?
Any experience will be welcome!
More information about the torqueusers