[torqueusers] Force a job to rerun after mom has crashed
knielson at adaptivecomputing.com
Wed Aug 24 16:03:02 MDT 2011
----- Original Message -----
> From: "David Sheen" <sheen at usc.edu>
> To: "Ken Nielson" <knielson at adaptivecomputing.com>
> Cc: "Mahmood Naderan" <nt_mahmood at yahoo.com>, "Torque Users Mailing List" <torqueusers at supercluster.org>
> Sent: Wednesday, August 24, 2011 2:53:25 PM
> Subject: Re: [torqueusers] Force a job to rerun after mom has crashed
> The node has been taken offline by the administrator for testing.
Not a good practice with MOMs running jobs. However, you can still run the pbs_mom -q when it restarts. But I am not sure if the job will still be at the server or not. If it is not at the server then the job is lost.
More information about the torqueusers