[torqueusers] node marked falsely down

Julian Hagenauer chaosbringer at gmx.de
Wed Nov 1 05:33:26 MST 2006


Hi,
i have a host running torque server and scheduler and a node running torque mom.
The node is shown as up as it should, but after some time, if the node got offline severall times, the node stays permanently marked as offline by pbsnodes, although tcpdump shows that UDP-packets are send from node to host (size 329) and from host to node (size 26).
What may be wrong?

Can anybody explain, how exactly healthcheck is done? The problem is, that i want the hiost to rekognize if the node is free as soon as possible.

Thank you,
Julian


More information about the torqueusers mailing list