[torqueusers] problem on torque
Garrick Staples
garrick at usc.edu
Tue Oct 11 09:34:39 MDT 2005
On Tue, Oct 11, 2005 at 05:03:44PM +0800, ?????? alleged:
> hello everyone:
>
> I use TORQUE 1.20P6,and on redhat LINUX9.0, there is a problem,when I run torque for a time, the daemon pbs_server is crashed, and I use GDB to debug it ,it report like this:
>
> Core was generated by `pbs_server'.
> Program terminated with signal 11, Segmentation fault.
> Reading symbols from /lib/libdl.so.2...done.
> Loaded symbols for /lib/libdl.so.2
> Reading symbols from /lib/tls/libc.so.6...done.
> Loaded symbols for /lib/tls/libc.so.6
> Reading symbols from /lib/ld-linux.so.2...done.
> Loaded symbols for /lib/ld-linux.so.2
> Reading symbols from /lib/libnss_files.so.2...done.
> Loaded symbols for /lib/libnss_files.so.2
> Reading symbols from /lib/libnss_nis.so.2...done.
> Loaded symbols for /lib/libnss_nis.so.2
> Reading symbols from /lib/libnsl.so.1...done.
> Loaded symbols for /lib/libnsl.so.1
> #0 0x08066325 in poll_job_task (ptask=0x80d23e0) at req_stat.c:627
> 627 if (server.sv_attr[(int)SRV_ATR_PollJobs].at_val.at_long &&
> (gdb) where
> #0 0x08066325 in poll_job_task (ptask=0x80d23e0) at req_stat.c:627
> #1 0x0806c198 in dispatch_task (ptask=0x80d23e0) at svr_task.c:198
> #2 0x08056530 in next_task () at pbsd_main.c:1288
> #3 0x08055e78 in main (argc=1, argv=0xbfffda84) at pbsd_main.c:1031
> #4 0x42015574 in __libc_start_main () from /lib/tls/libc.so.6
> (gdb) quit
>
> I send a lot of job , and when run about 1000-2000jobs , there is a crash
>
> please help me , thank you
I believe this was already fixed in a p7 snapshot.
--
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20051011/cb653172/attachment.bin
More information about the torqueusers
mailing list