[torqueusers] problem on torque

=?gb2312?B?ydDD8ce/?= shangmq at bgp.com.cn
Tue Oct 11 03:03:44 MDT 2005


hello everyone:

     I use TORQUE 1.20P6,and on redhat LINUX9.0, there is a problem,when I run torque for a time, the daemon pbs_server is crashed, and I use GDB to debug it ,it report like this:

Core was generated by `pbs_server'.
Program terminated with signal 11, Segmentation fault.
Reading symbols from /lib/libdl.so.2...done.
Loaded symbols for /lib/libdl.so.2
Reading symbols from /lib/tls/libc.so.6...done.
Loaded symbols for /lib/tls/libc.so.6
Reading symbols from /lib/ld-linux.so.2...done.
Loaded symbols for /lib/ld-linux.so.2
Reading symbols from /lib/libnss_files.so.2...done.
Loaded symbols for /lib/libnss_files.so.2
Reading symbols from /lib/libnss_nis.so.2...done.
Loaded symbols for /lib/libnss_nis.so.2
Reading symbols from /lib/libnsl.so.1...done.
Loaded symbols for /lib/libnsl.so.1
#0  0x08066325 in poll_job_task (ptask=0x80d23e0) at req_stat.c:627
627       if (server.sv_attr[(int)SRV_ATR_PollJobs].at_val.at_long && 
(gdb) where
#0  0x08066325 in poll_job_task (ptask=0x80d23e0) at req_stat.c:627
#1  0x0806c198 in dispatch_task (ptask=0x80d23e0) at svr_task.c:198
#2  0x08056530 in next_task () at pbsd_main.c:1288
#3  0x08055e78 in main (argc=1, argv=0xbfffda84) at pbsd_main.c:1031
#4  0x42015574 in __libc_start_main () from /lib/tls/libc.so.6
(gdb) quit

I send a lot of job , and when run about 1000-2000jobs , there is a crash

please help me , thank you

            smq


¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡shangmq at bgp.com.cn
¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡2005-10-11 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: fox.gif
Type: image/gif
Size: 9519 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20051011/9e67bc06/fox.gif


More information about the torqueusers mailing list