[torqueusers] Question About Desired Behavior
glen.beane at gmail.com
Tue Mar 26 11:33:30 MDT 2013
On Tue, Mar 26, 2013 at 12:41 PM, David Beer
<dbeer at adaptivecomputing.com> wrote:
> Our QA tests have exposed that when a job file is loaded saying that it's
> state is running but there is no exec host list defined we don't handle this
> state, that is, we attempt to perform actions on the job that assume it is
> running, but we can't talk to the mom because we don't know what mom it is.
> I can think of two different behaviors:
> 1. delete the job
> 2. requeue the job
> Which one would you all prefer?
how does a job get into this state in the first place?
More information about the torqueusers