[torqueusers] server change down state when detect message error

jupiter jupiter.hce at gmail.com
Fri Mar 28 17:27:39 MDT 2014


Thanks David and Rick, that indeed resolves the issue, great
appreciate your help.

jupiter

On 3/29/14, David Beer <dbeer at adaptivecomputing.com> wrote:
> You can also set this on pbs_server if you want it to apply to all moms in
> your system:
>
> qmgr -c 'set server down_on_error=true'
>
>
> On Fri, Mar 28, 2014 at 11:51 AM, Rick McKay
> <rmckay at adaptivecomputing.com>wrote:
>
>> I'm not too familiar with Maui, but $down_on_error is specific to TORQUE.
>> To enable it, put the following into your $PBSHOME/mom_priv/config file:
>>
>> $down_on_error true
>>
>> I don't know the exact version that was enabled. I did a quick check and
>> confirmed that it got added somewhere between 2.5.3 and 2.5.9. Maybe
>> someone else here can speak to why the documentation still calls it
>> experimental. I did some tests and it appears to function as advertised.
>>
>> --Rick
>>
>>
>> On Thu, Mar 27, 2014 at 8:49 PM, jupiter <jupiter.hce at gmail.com> wrote:
>>
>>> I am using maui, so it does not implement in maui?
>>>
>>> Thanks Rick.
>>>
>>> jupiter
>>>
>>>
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>
>>
>
>
> --
> David Beer | Senior Software Engineer
> Adaptive Computing
>


More information about the torqueusers mailing list