[torqueusers] Force a job to rerun after mom has crashed

"Mgr. Šimon Tóth" toth at fi.muni.cz
Wed Aug 24 09:27:45 MDT 2011


> Is there any straightforward way to force a job to rerun on a
> different node after its MOM has crashed?

This is a PBS Pro feature not supported in Torque.

But in Torque, when a node crashes, it doesn't really mean anything. 
Once the pbs_mom process is restarted, it will detect the jobs and 
reattach them.

-- 
Mgr. Simon Toth


More information about the torqueusers mailing list