[torqueusers] Preventing jobs to be re-runed when
david.mcgiven at fusemail.com
Mon Mar 13 13:16:13 MST 2006
Dear TORQUE users,
I was running a job in one of my cluster nodes. Due to an electrical
problem the node was suddenly and unexpectedly rebooted.
While it was rebooting, the job was marked with an "E" when issuing qstat
command. One minute after or so, when the node came back to normal
operation, the job was "R" again. The system had automatically started the
How can I prevent this from happening?
It's very dangerous because not all the jobs are meant to be resumed
"automatically" and they might overwritte the already processed data.
I'm using TORQUE + Maui.
Thanks in advance.
More information about the torqueusers