2.4 Job Preemption
TORQUE supports job preemption by allowing authorized users to suspend and resume jobs. This is supported using one of two methods. If the node supports OS-level preemption, TORQUE will recognize that during the configure process and enable it. Otherwise, the MOM may be configured to launch a custom checkpoint script in order to support preempting a job. Using a custom checkpoint script requires that the job understand how to resume itself from a checkpoint after the preemption occurs.
Configuring a Checkpoint Script on a MOM
To configure the MOM to support a checkpoint script, the
$pbsserver node06 $logevent 255 $restricted *.mycluster.org $checkpoint_script /opt/moab/tools/mom-checkpoint.sh
The second thing that must be done to enable the checkpoint script is to change the value of
#define MOM_CHECKPOINT 1
|© 2001-2010 Adaptive Computing Enterprises, Inc.|