[Mauiusers] Maui is unexpectedly down
soo5 at fireworks.kist.re.kr
Mon Aug 15 21:11:01 MDT 2005
Thank you for your help.
As you wrote down, I changed some confi. in maui.cfg file as follows.
Then maui works find till now.
RMCFG[head node] TIMEOUT=30
LOG LEVEL 9 (advised by Mr. Garrick)
But I cannot find any explanation or guidelines of
'RMCFG TIMEOUT' and 'JOBAGGREGATIONTIME' variables
in admi~.pdf file nor web site.
In my opinition, most important variable is the 'RMPOLLINTERVAL'.
I really appreciate your help.
From: mcgregor at fnal.gov
To: 오정수 <soo5 at fireworks.kist.re.kr>
Cc: mauiusers at supercluster.org
Date: Thu, 11 Aug 2005 16:28:22 -0600
Subject: Re: [Mauiusers] Maui is unexpectedly down
> I did two things. Firstly I installed monit
> (http://www.tildeslash.com/monit/) to monitor maui and other critial
> services as a safetly net. I also made the following changes to maui's
> RMCFG[head node] TIMEOUT=30
> JOBAGGREGATIONTIME 00:00:10
> RMPOLLINTERVAL 00:02:30
> which I think I just had as defaults before.
> So far (since July 29th) monit has not had to restart maui, so perhaps
> these changes did the trick.
More information about the mauiusers