[Mauiusers] Maui exit by itself

Garrick Staples garrick at usc.edu
Tue Jul 26 19:23:26 MDT 2005


I don't see a single alarm() call in maui, but I do see one in Torque's client
library!

My guess is this is in Torque's pbs_disconnect().


On Wed, Jul 27, 2005 at 09:03:24AM +0800, group hpc alleged:
> Hi,
> 
> The following is the output from gdb, pls help. Thanks.
> 
> (gdb) r
> Starting program: /usr/local/maui/sbin/maui
> Detaching after fork from child process 5014.
> Detaching after fork from child process 5479.
> Detaching after fork from child process 5904.
> Detaching after fork from child process 6320.
> Detaching after fork from child process 6733.
> Detaching after fork from child process 7156.
> Detaching after fork from child process 7561.
> Detaching after fork from child process 7974.
> Detaching after fork from child process 8393.
> Detaching after fork from child process 8816.
> Detaching after fork from child process 9241.
> Detaching after fork from child process 9666.
> Detaching after fork from child process 10565.
> Detaching after fork from child process 10980.
> Detaching after fork from child process 11393.
> Detaching after fork from child process 11810.
> Detaching after fork from child process 12221.
> Detaching after fork from child process 12642.
> Detaching after fork from child process 13068.
> Detaching after fork from child process 13369.
> 
> Program terminated with signal SIGALRM, Alarm clock.
> The program no longer exists.
> (gdb) where
> No stack.
> 
> Best Regards,
> Josh
> 
> > 
> > I have seen maui exit with exactly the same error message. I would really appreciate any advice on this matter.
> > 
> > Gordon.
> > 
> > ----- Original Message -----
> > From: group hpc <hpc.group at gmail.com>
> > Date: Thursday, July 14, 2005 7:50 pm
> > Subject: [Mauiusers] Maui exit by itself
> > 
> > > Hi,
> > >
> > > I would like to find out why the maui scheduler suddenly exit by
> > > itself.Does anyone know how to resolve this problem? The following
> > > is a last
> > > log messgae before it exits.
> > >
> > > 07/14 14:11:06 MSURecvPacket(10,BufP,4,NULL,100000)
> > > 07/14 14:11:07 ServerProcessRequests()
> > > 07/14 14:11:07 INFO:     not rolling logs (6889559 < 10000000)
> > > 07/14 14:11:07 MResAdjust(NULL,0,0)
> > > 07/14 14:11:07 MStatInitializeActiveSysUsage()
> > > 07/14 14:11:07 MStatClearUsage([NONE],Active)
> > > 07/14 14:11:07 ServerUpdate()
> > > 07/14 14:11:07 MSysUpdateTime()
> > > 07/14 14:11:07 INFO:     starting iteration 1215
> > > 07/14 14:11:07 MRMGetInfo()
> > > 07/14 14:11:07 MClusterClearUsage()
> > > 07/14 14:11:07 MRMClusterQuery()
> > > 07/14 14:11:07 MPBSClusterQuery(META,RCount,SC)
> > > 07/14 14:11:07 ERROR:    cannot get node info: NULL
> > >
> > > Thanks.

-- 
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/mauiusers/attachments/20050726/e255a8e0/attachment.bin


More information about the mauiusers mailing list