[torquedev] [Bug 218] Jobs getting stuck in exiting "job recycled into exiting on SIGNULL/KILL"

bugzilla-daemon at supercluster.org bugzilla-daemon at supercluster.org
Wed Oct 10 21:36:37 MDT 2012


http://www.clusterresources.com/bugzilla/show_bug.cgi?id=218

--- Comment #1 from Chris Samuel <chris at csamuel.org> 2012-10-10 21:36:36 MDT ---
I can confirm this is still happening with latest RHEL 5.8 updates. We did a
full reinstall of the affected cluster last week but it's still happening:

[root at merri-m ~]# xdsh compute -v 'fgrep -h "job recycled into exiting"
/var/spool/torque/mom_logs/201210*' | awk -F\; '{print $NF}' | sort | uniq -c
      2 job recycled into exiting on SIGNULL/KILL from substate 42
      1 job recycled into exiting on SIGNULL/KILL from substate 50
     19 job recycled into exiting on SIGNULL/KILL from substate 57

Any ideas please?    It's driving us (and our users) nuts..

-- 
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


More information about the torquedev mailing list