[torquedev] [Bug 218] Jobs getting stuck in exiting "job recycled into exiting on SIGNULL/KILL"
bugzilla-daemon at supercluster.org
bugzilla-daemon at supercluster.org
Wed Oct 10 21:36:37 MDT 2012
http://www.clusterresources.com/bugzilla/show_bug.cgi?id=218
--- Comment #1 from Chris Samuel <chris at csamuel.org> 2012-10-10 21:36:36 MDT ---
I can confirm this is still happening with latest RHEL 5.8 updates. We did a
full reinstall of the affected cluster last week but it's still happening:
[root at merri-m ~]# xdsh compute -v 'fgrep -h "job recycled into exiting"
/var/spool/torque/mom_logs/201210*' | awk -F\; '{print $NF}' | sort | uniq -c
2 job recycled into exiting on SIGNULL/KILL from substate 42
1 job recycled into exiting on SIGNULL/KILL from substate 50
19 job recycled into exiting on SIGNULL/KILL from substate 57
Any ideas please? It's driving us (and our users) nuts..
--
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
More information about the torquedev
mailing list