[torqueusers] Re: kill_delay
Garrick Staples
garrick at clusterresources.com
Tue Feb 27 15:01:26 MST 2007
On Tue, Feb 27, 2007 at 10:09:12PM +0100, Roy Dragseth alleged:
> Probably a good idea, but who is initiating the SIGKILL eventually? The
> scheduler or pbs_server?
pbs_mom can detect a variety of overlimit conditions, and initiate the
process of killing off the job.
pbs_server, itself, never initiates such things.
The scheduler, specifically maui and moab, can 'qdel' a job for whatever
policy reason.
Once a qdel is sent to pbs_server, it instructions pbs_mom to SIGTERM
all processes, and then if the job hasn't exited within kill_delay
seconds, instructions pbs_mom to SIGKILL processes. And if the
"job_nanny" server attribute is enabled, it will continue to send
SIGKILLs as long as the job hasn't exited.
More information about the torqueusers
mailing list