[Mauiusers] maui crash in pbs_statfree

Garrick Staples garrick at usc.edu
Mon Jan 9 15:29:10 MST 2006


On Mon, Jan 09, 2006 at 04:55:31PM -0500, Andrew J Caird alleged:
> 
> We just started seeing maui crashes recently.  We're running 
> maui-3.2.6p14-snap.1126543721 and the crash happens:

Just recently?  Has maui or PBSPro been updated recently?

What is the last thing in maui's log?

The segfault is inside of the PBS client libs, so you'd need to figure
out if maui is corrupting memory, or if the problem is in PBSPro.

I'd run maui under valgrind first.  And of course, try a more recent
maui snapshot.

 
> Starting program: /usr/local/maui-3.2.6p14-snap/sbin/maui -d
> 
> Program received signal SIGSEGV, Segmentation fault.
> 0x0000002a9583848f in chunk_free () from /lib64/libc.so.6
> (gdb) where
> #0  0x0000002a9583848f in chunk_free () from /lib64/libc.so.6
> #1  0x0000002a958383b6 in free () from /lib64/libc.so.6
> #2  0x00000000400ffb3d in pbs_statfree (bsp=0x42f861f0)
>     at /home/proett/PBSPro_5.4.1.41671/pbs/src/lib/Libifl/pbs_statfree.c:43
> #3  0x00000000400cbe8e in MPBSWorkloadQuery (R=0x41bbe440, 
> JCount=0x7fbffecc04, SC=0x0) at MPBSI.c:894
> #4  0x0000000040082bc7 in __MUTFunc (V=0x7fbffeca40) at MUtil.c:4717
> #5  0x0000000040082b34 in MUThread (F=0x400cb6b1 <MPBSWorkloadQuery>, 
> TimeOut=9, RC=0x7fbffecc08,
>     ACount=3, Lock=0x0) at MUtil.c:4690
> #6  0x00000000400c1de1 in MRMWorkloadQuery (WCount=0x7fbffed054, SC=0x0) at 
> MRM.c:595
> #7  0x00000000400c175c in MRMGetInfo () at MRM.c:364
> #8  0x000000004003db0d in MSchedProcessJobs (OldDay=0x7fbffff110 "Mon", 
> GlobalSQ=0x7fbfffb110,
>     GlobalHQ=0x7fbfff7110) at MSched.c:6823
> #9  0x000000004000358a in main (ArgC=2, ArgV=0x7fbffff1c8) at Server.c:189
> #10 0x0000002a957ea087 in __libc_start_main () from /lib64/libc.so.6
> #11 0x0000000040002d3a in _start ()
> 
> 
> I haven't looked beyond that, but figured the sooner I got this in front 
> of more eyes the better.
> 
> Thanks.
> --andy
> _______________________________________________
> mauiusers mailing list
> mauiusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/mauiusers

-- 
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/mauiusers/attachments/20060109/13330005/attachment.bin


More information about the mauiusers mailing list