[torqueusers] dependencies on completed jobs
Alexander Saydakov
saydakov at yahoo-inc.com
Tue Sep 5 15:58:41 MDT 2006
> -----Original Message-----
> From: torqueusers-bounces at supercluster.org [mailto:torqueusers-
> bounces at supercluster.org] On Behalf Of Garrick Staples
> Sent: Tuesday, September 05, 2006 2:40 PM
> To: torqueusers at supercluster.org
> Subject: Re: [torqueusers] dependencies on completed jobs
>
> On Tue, Sep 05, 2006 at 12:45:27PM -0700, Sam Rash alleged:
> > So we've noticed that if we submit job A, then submit job B which
> depends on
> > A (-W depend=afterok:B_job_id), and A has already completed (we have
> > keep_completed set to at least 30 min), B get stuck in the hold state.
> Is
> > this intentional? Or a bug?
> >
> >
> >
> > It seems like B should surely run.
> >
> > (maybe A updates its dependents when it completes and B won't check
> > explicitly?)
>
> The simple test works fine for me.
>
> [garrick at hpcjr-master garrick]$ echo sleep 60 | qsub
> 81384.hpcjr-master.usc.edu
> [garrick at hpcjr-master garrick]$ echo sleep 60 | qsub -W
> depend=afterok:81384
> 81385.hpcjr-master.usc.edu
> [garrick at hpcjr-master garrick]$ echo sleep 60 | qsub -W
> depend=afterok:81385
> 81386.hpcjr-master.usc.edu
>
> After 1.5 minutes:
> 81384.hpcjr-master.u garrick batch STDIN 10643 1 -- --
> 01:00 C 00:00
> 81385.hpcjr-master.u garrick batch STDIN 10804 1 -- --
> 01:00 R --
> 81386.hpcjr-master.u garrick batch STDIN -- 1 -- --
> 01:00 H --
>
> And after 2.5 minutes:
> 81384.hpcjr-master.u garrick batch STDIN 10643 1 -- --
> 01:00 C 00:00
> 81385.hpcjr-master.u garrick batch STDIN 10804 1 -- --
> 01:00 C 00:00
> 81386.hpcjr-master.u garrick batch STDIN 10910 1 -- --
> 01:00 R --
I am afraid you did not get the point. Try submitting the second one after
the first one has finished.
In general, you never know if it has finished of not. Even if you have just
checked a second ago.
More information about the torqueusers
mailing list