[torqueusers] pbs_sched crash

Alexander Saydakov saydakov at yahoo-inc.com
Wed Mar 22 12:04:08 MST 2006


Last night pbs_sched crashed leaving our 70+ nodes idle all night long :(

 

-rw-------  1 root  wheel  1612468224 Mar 21 23:07 pbs_sched.core

 

Note the size!

 

We are running 2.0.0p7

 

> gdb pbs_sched pbs_sched.core

GNU gdb 4.18 (FreeBSD)

Copyright 1998 Free Software Foundation, Inc.

GDB is free software, covered by the GNU General Public License, and you are

welcome to change it and/or distribute copies of it under certain
conditions.

Type "show copying" to see the conditions.

There is absolutely no warranty for GDB.  Type "show warranty" for details.

This GDB was configured as "i386-unknown-freebsd"...Deprecated bfd_read
called at
/home/src/gnu/usr.bin/binutils/gdb/../../../../contrib/gdb/gdb/dbxread.c
line 2627 in elfstab_build_psymtabs

Deprecated bfd_read called at
/home/src/gnu/usr.bin/binutils/gdb/../../../../contrib/gdb/gdb/dbxread.c
line 933 in fill_symbuf

 

Core was generated by `pbs_sched'.

Program terminated with signal 11, Segmentation fault.

Reading symbols from /usr/lib/libkvm.so.2...done.

Reading symbols from /usr/lib/libc.so.4...done.

Reading symbols from /usr/libexec/ld-elf.so.1...done.

#0  0x1013c8e in pbs_rescquery (c=0, resclist=0x9fbff484, num_resc=1,
available=0x9fbff498, allocated=0x9fbff494, reserved=0x9fbff490,
down=0x9fbff48c)

    at ./../Libifl/pbsD_resc.c:218

218           *(available + i) = *(reply->brp_un.brp_rescq.brq_avail + i);

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20060322/19f69402/attachment.html


More information about the torqueusers mailing list