[torqueusers] Force a job to rerun after mom has crashed

Ken Nielson knielson at adaptivecomputing.com
Wed Aug 24 16:03:02 MDT 2011



----- Original Message -----
> From: "David Sheen" <sheen at usc.edu>
> To: "Ken Nielson" <knielson at adaptivecomputing.com>
> Cc: "Mahmood Naderan" <nt_mahmood at yahoo.com>, "Torque Users Mailing List" <torqueusers at supercluster.org>
> Sent: Wednesday, August 24, 2011 2:53:25 PM
> Subject: Re: [torqueusers] Force a job to rerun after mom has crashed
> Ken,
> 
> The node has been taken offline by the administrator for testing.
> 
> David
> 
> 

Not a good practice with MOMs running jobs. However, you can still run the pbs_mom -q when it restarts. But I am not sure if the job will still be at the server or not. If it is not at the server then the job is lost.

Ken


More information about the torqueusers mailing list