[torqueusers] Torque/maui node failure policy
wyckoff at yahoo-inc.com
Mon Jun 18 18:28:27 MDT 2007
I want to configure torque in such a way that if any node other than the
node running pbsdsh (the head node?) fails, do __NOTHING__ - don't cancel
the job or re-run it or anything.
My code handles all failures other than the 1st node failing.
Is there a way to configure torque to do nothing other than the head node?
Or do nothing no matter what ? (since head node failures should be rare as
opposed to other nodes).
More information about the torqueusers