[torqueusers] recovery behavior question

John Wang jwang at dataseekonline.com
Thu Feb 14 11:12:46 MST 2008


Hello Tim

So you're stopping the pbs_mom daemon on the compute nodes to prevent jobs
from running on them?

That had been the practice here as well.   It just seems to me that we
shouldn't have to use such work arounds.

Regards,
John


On 2/13/08 8:48 PM, "Tim Freeman" <tfreeman at mcs.anl.gov> wrote:

> If I submit a job with all moms in the pool in the 'down' state, the job sits
> in the queue as expected.  Then I bring up a node but the job in the queue is
> not run until I submit another job (and they both do run).
> 
> Is this expected?  Is there a setting I am missing to get around this?
> 
> (Torque 2.2.1, Maui 3.2.6p19 but I saw this with pbs_sched too)
> 
> Thankyou,
> Tim
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers



More information about the torqueusers mailing list