[torqueusers] torque-1.2.0p6 - massive emails and job nanny
Simon Gao
gao at schrodinger.com
Wed Sep 21 16:16:48 MDT 2005
when tried enabling job_nanny, I got following error:
Qmgr: set server job_nanny=T
qmgr: Syntax error - cannot locate attribute
set server job_nanny=T
^
The torque version is 1.2.0p6. Is there parameters required while
compiling torque to add the attribute?
Simon Gao
Garrick Staples wrote:
>On Mon, Sep 19, 2005 at 04:27:56PM -0700, Tony Vu alleged:
>
>
>>Hello,
>>
>>Like some people on this list, our have users received multiple
>>emails in the past when their jobs completed. We just recently
>>upgraded to patch 6 and we are still seeing this problem. After
>>browsing through this list, I read that the atttribute "job_nanny"
>>needs to be turned on to alleviate this problem since by default it
>>is not set.
>>
>>From what I understand, Torque will continually send multiple kill/
>>cancel/delete signals to an exiting job if for some reason it cannot
>>communicate with the mother superior node on the initial try. Is
>>this correct? If I set the job_nanny attribute to true will only the
>>initial job delete signal be acknowledged and subsequent ones be
>>ignored? Is this an option that needs to be turned on before
>>compiling Torque in the configure script or is support for it
>>compiled in by default?
>>
>>Also, is a server restart required if this server attribute is set or
>>is it dynamic?
>>
>>
>
>Yes, job_nanny causes subsequent deletes to be ignored. It's not a
>compile-time option, just type 'set server job_nanny=T' into qmgr. It
>does not require a server restart.
>
>
>
>------------------------------------------------------------------------
>
>_______________________________________________
>torqueusers mailing list
>torqueusers at supercluster.org
>http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
More information about the torqueusers
mailing list