[torqueusers] node marked falsely down
chaosbringer at gmx.de
Wed Nov 1 05:33:26 MST 2006
i have a host running torque server and scheduler and a node running torque mom.
The node is shown as up as it should, but after some time, if the node got offline severall times, the node stays permanently marked as offline by pbsnodes, although tcpdump shows that UDP-packets are send from node to host (size 329) and from host to node (size 26).
What may be wrong?
Can anybody explain, how exactly healthcheck is done? The problem is, that i want the hiost to rekognize if the node is free as soon as possible.
More information about the torqueusers