[torqueusers] pbs_mom endless kill loop

Kevin Murphy murphy at genome.chop.edu
Mon Oct 20 19:45:28 MDT 2008

Torque 2.3.0 on CentOS 5 (Rocks V).

For reasons unknown, our moms occasionally explode in an orgy of 
logging, repeatedly writing messages like this:

10/20/2008 12:15:47;0080; pbs_mom;Svr;preobit_reply;top of preobit_reply
10/20/2008 12:15:47;0080; 
pbs_mom;Svr;preobit_reply;DIS_reply_read/decode_DIS_replySvr worked, top
of while loop
10/20/2008 12:15:47;0080; pbs_mom;Svr;preobit_reply;in while loop, no 
error from job stat
10/20/2008 12:15:47;0001; 
pbs_mom;Job;19139.variome.chop.edu;scan_for_exiting: sending signal 9,
"KILL" to job 19140.variome.chop.edu, reason: local task termination

These messages are endlessly repeated.  Thoughts?


More information about the torqueusers mailing list