[torqueusers] Node Health Check Script
dbeer at adaptivecomputing.com
Wed Apr 24 16:07:37 MDT 2013
Currently, no error is thrown if the script cannot be launched or the pipe
cannot be read for whatever reason.
On Wed, Apr 24, 2013 at 3:28 PM, Stephen Fralich <sjf4 at uw.edu> wrote:
> I ran into a case recently where new processes could not be forked on a
> compute node, but Torque and Moab both reported the node was working
> normally. I have node health check scripts set up and I was surprised that
> this did not cause the node to be set offline. Torque does not check to
> make sure it can successfully run the script?
> torqueusers mailing list
> torqueusers at supercluster.org
David Beer | Senior Software Engineer
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the torqueusers