[torqueusers] pbs_server crash

Tom Rudwick tomr at intrinsity.com
Mon Aug 25 13:23:35 MDT 2008


We are running 2.3.3 and getting pbs_server crashes.
We are on RedHat Enterprise 4 update 4.
We do have routing queues. (Which seems to be the code it is
in when it crashes...)

Does any of this ring any bells for anyone?

Thanks,
Tom


[ty]# gdb /usr/sbin/pbs_server core
GNU gdb Red Hat Linux (6.3.0.0-1.132.EL4rh)
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "x86_64-redhat-linux-gnu"...
warning: not using untrusted file "/export/home/tomr/.gdbinit"
(no debugging symbols found)
Using host libthread_db library "/lib64/tls/libthread_db.so.1".

Core was generated by `/usr/sbin/pbs_server'.
Program terminated with signal 6, Aborted.
Reading symbols from /usr/lib/libtorque.so.2...done.
Loaded symbols for /usr/lib/libtorque.so.2
Reading symbols from /lib64/tls/libc.so.6...done.
Loaded symbols for /lib64/tls/libc.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2
Reading symbols from /lib64/libnss_files.so.2...done.
Loaded symbols for /lib64/libnss_files.so.2
Reading symbols from /lib64/libnss_nis.so.2...done.
Loaded symbols for /lib64/libnss_nis.so.2
Reading symbols from /lib64/libnsl.so.1...done.
Loaded symbols for /lib64/libnsl.so.1
Reading symbols from /lib64/libnss_dns.so.2...done.
Loaded symbols for /lib64/libnss_dns.so.2
Reading symbols from /lib64/libresolv.so.2...done.
Loaded symbols for /lib64/libresolv.so.2
#0  0x0000003298c2e21d in raise () from /lib64/tls/libc.so.6
(gdb) where
#0  0x0000003298c2e21d in raise () from /lib64/tls/libc.so.6
#1  0x0000003298c2fa1e in abort () from /lib64/tls/libc.so.6
#2  0x00000000004111a4 in catch_abort ()
#3  <signal handler called>
#4  0x0000000000427513 in svr_dequejob ()
#5  0x0000000000429be8 in svr_movejob ()
#6  0x000000000040af82 in default_router ()
#7  0x000000000041dee6 in req_commit ()
#8  0x0000000000414e07 in dispatch_request ()
#9  0x0000000000415551 in process_request ()
#10 0x0000002a95576ca6 in wait_request (waittime=Variable "waittime" is not available.
) at ../Libnet/net_server.c:451
#11 0x0000000000414173 in main ()
(gdb) up



More information about the torqueusers mailing list