[torqueusers] Preventing jobs to be re-runed when

David McGiven david.mcgiven at fusemail.com
Mon Mar 13 13:16:13 MST 2006


Dear TORQUE users,

I was running a job in one of my cluster nodes. Due to an electrical
problem the node was suddenly and unexpectedly rebooted.

While it was rebooting, the job was marked with an "E" when issuing qstat
command. One minute after or so, when the node came back to normal
operation, the job was "R" again. The system had automatically started the
job again.

How can I prevent this from happening?

It's very dangerous because not all the jobs are meant to be resumed
"automatically" and they might overwritte the already processed data.

I'm using TORQUE + Maui.

Thanks in advance.

Regards,
David McGiven


More information about the torqueusers mailing list