[torqueusers] Jobs stuck in Queue

Joshua Bernstein jbernstein at penguincomputing.com
Thu Oct 4 14:10:29 MDT 2007


Bill Wichser wrote:
> Are you giving it enough time to clear the data from Torque?  Sometimes 
> it takes a bit.

What would you say a "bit"? I'd imagine it would clear out after at 
least 30 seconds, if not right away.

> Also try using qsig instead of qdel for running jobs.

Whats the difference? Doesn't a qdel send a SIGKILL?

Also, the jobs are clearly getting the SIGKILL, because a ps on the node 
shows that the jobs don't exist. I'm doing a watch ps, and I can see 
that right after I issue the qdel, the processes begin to clean 
themselves up and eventually disappear from the process table.

-Joshua Bernstein
Software Engineer
Penguin Computing

More information about the torqueusers mailing list