[torqueusers] Special job for reboot

Arnau Bria arnaubria at pic.es
Mon Feb 1 08:35:58 MST 2010


Sorry, did reply to Chris only.

On Sat, Jan 30, 2010 at 11:22 AM, Chris Samuel <chris at csamuel.org> wrote:

> On Fri, 29 Jan 2010 01:05:27 am Arnau Bria wrote:
>
> > But is someone really doing reboot via torque? What are your steps when
> > you need to reboot your farm?
>
> At VPAC we did this through having a system user who would get a priority
> boost via Moab's config which would submit jobs asking for
> nodes=tango001:ppn=8 (for example) and then doing "sudo reboot" for
> instance.
>
> Usually though we use our health check scripts to spot things like an out
> of
> date kernel on the node and then it marks itself offline for manual
> intervention.
>
> cheers,
> Chris
> --
>  Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC
>
> This email may come with a PGP signature as a file. Do not panic.
> For more info see: http://en.wikipedia.org/wiki/OpenPGP
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20100201/c79b7870/attachment.html 


More information about the torqueusers mailing list