[Mauiusers] Return code of job

Garrick Staples garrick at clusterresources.com
Thu Jun 29 10:47:59 MDT 2006


On Thu, Jun 29, 2006 at 08:57:34AM -0700, Keenahn Jung alleged:
> Hello! Thanks for your quick reply. However, that does not answer my
> question. Could you perhaps point me to the lines of code in Maui where
> it detects that a job has failed and sends a message back to the
> resource manager?

I don't think maui has that feature.  Probably need moab for that.

Greping through maui's source code for 'exit_status' and 'ATTR_exitstat'
doesn't find anything.

 
> Thanks, K
> 
> -----Original Message-----
> From: mauiusers-bounces at supercluster.org
> [mailto:mauiusers-bounces at supercluster.org] On Behalf Of Garrick Staples
> Sent: Wednesday, June 28, 2006 1:22 PM
> To: mauiusers at supercluster.org
> Subject: Re: [Mauiusers] Return code of job
> 
> On Tue, Jun 27, 2006 at 05:44:30PM -0700, Keenahn Jung alleged:
> > >From the documentation:
> > 
> > " Maui supports two other types of holds. The first is a temporary
> hold
> > known as a 'defer'. A job is deferred if the scheduler determines that
> > it cannot run. This can be because it asks for resources which do not
> > currently exist, does not have allocations to run, is rejected by the
> > resource manager, repeatedly fails after start up, etc."
> > 
> > How does Maui determine that the job has failed? I assume that it
> > somehow has access to the return code of the job, is this correct? If
> > this is so, how can I retrieve the return code? Thank you!
> 
> It depends on which resource manager you are using.
> 
> If you are using TORQUE, enable the "keep_completed" feature and you can
> read the "exit_status" attribute from a job stat.
> 
> _______________________________________________
> mauiusers mailing list
> mauiusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/mauiusers


More information about the mauiusers mailing list