[torqueusers] recovery behavior question
jwang at dataseekonline.com
Thu Feb 14 11:12:46 MST 2008
So you're stopping the pbs_mom daemon on the compute nodes to prevent jobs
from running on them?
That had been the practice here as well. It just seems to me that we
shouldn't have to use such work arounds.
On 2/13/08 8:48 PM, "Tim Freeman" <tfreeman at mcs.anl.gov> wrote:
> If I submit a job with all moms in the pool in the 'down' state, the job sits
> in the queue as expected. Then I bring up a node but the job in the queue is
> not run until I submit another job (and they both do run).
> Is this expected? Is there a setting I am missing to get around this?
> (Torque 2.2.1, Maui 3.2.6p19 but I saw this with pbs_sched too)
> torqueusers mailing list
> torqueusers at supercluster.org
More information about the torqueusers