[torqueusers] downing a node via qmgr

Garrick Staples garrick at usc.edu
Thu Sep 22 14:00:58 MDT 2005


On Thu, Sep 22, 2005 at 09:25:03AM +0200, Ronny T. Lampert alleged:
> > I would strongly suggest that you do not start the pbs_mom automatically on a 
> > reboot via init scripts.
> > 
> > If you've rebooted the node yourself then you should restart it by hand 
> > whereas if the node dies and reboots you're probably going to want to 
> > investigate.  We do this, and only restart the mom when we've got a better 
> > handle on things and think it safe to do so.
> 
> ... so if I want to disable a node, I simply delete it in qmgr.
> That way the mom can come up again, but jobs won't be scheduled.
> 
> However, you will lose your settings (of course), so be careful with deletion.
> As I have none, I am the lucky guy :)

That is the whole point of OFFLINE.  Just 'pbsnodes -o hostname' to
prevent further scheduling of new jobs.  Then 'pbsnodes -c hostname'
when it is ready.

-- 
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20050922/18db9f50/attachment.bin


More information about the torqueusers mailing list