[torqueusers] Question About Desired Behavior

Glen Beane glen.beane at gmail.com
Tue Mar 26 11:33:30 MDT 2013


On Tue, Mar 26, 2013 at 12:41 PM, David Beer
<dbeer at adaptivecomputing.com> wrote:
> All,
>
> Our QA tests have exposed that when a job file is loaded saying that it's
> state is running but there is no exec host list defined we don't handle this
> state, that is, we attempt to perform actions on the job that assume it is
> running, but we can't talk to the mom because we don't know what mom it is.
> I can think of two different behaviors:
>
> 1. delete the job
> 2. requeue the job
>
> Which one would you all prefer?


how does a job get into this state in the first place?


More information about the torqueusers mailing list