[torqueusers] qdel jobs are not cleanup
"Mgr. Šimon Tóth"
SimonT at mail.muni.cz
Wed Sep 15 08:23:09 MDT 2010
> Thanks for suggestions.
> I am not using -p options and they are not mpi jobs.
> I only come to know the job stuck on node when /var space is alarming.
> I would be nice if mom having any parameter who could kill such stuck job.
As I already said, you can cleanup jobs from nodes using the momctl tool.
If the node doesn't report the job at all:
status = .....jobs=624.torque1.ics.muni.cz.....
Then it means only one thing, you spawned a process, but didn't register
it with the node, therefore when the node kills your job, it doesn't
kill the process (because she has no idea that it belongs to the job).
Mgr. Šimon Tóth
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 3366 bytes
Desc: S/MIME Cryptographic Signature
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20100915/c601612e/attachment.bin
More information about the torqueusers