[torqueusers] internal socket table full log message

Ken Nielson knielson at adaptivecomputing.com
Mon Mar 7 08:33:34 MST 2011


ulimit is not what is causing this problem. The error comes from an internal table full problem. In TORQUE 2.5.5 the table size is 10240. 

How busy and large is your system? Do you have lingering sockets. Try netstat and see how many open tcp connections you have. 

Ken Nielson
Adaptive Computing

----- Original Message -----
From: "\"Hung-Sheng Tsao (Lao Tsao 老曹) Ph. D.\"" <laotsao at gmail.com>
To: torqueusers at supercluster.org
Sent: Monday, March 7, 2011 3:44:19 AM
Subject: Re: [torqueusers] internal socket table full log message



what is server's ulimit -n 
by default it is set to 1024 
one can increase it, try 2048, 4096 
regards 

On 3/6/2011 11:52 PM, Abhishek Gupta wrote: 


Hi, 
We are getting this message popped up in our pbs log file: 

03/06/2011 23:13:33;0001;PBS_Server;Svr;PBS_Server;LOG_ALERT::socket_to_handle, internal socket table full (1024) - num_connections is 6 
03/06/2011 23:13:34;0001;PBS_Server;Svr;PBS_Server;LOG_ALERT::socket_to_handle, internal socket table full (1024) - num_connections is 7 


Anyone has any idea about this message? Due to this message, no one was able to run the jobs and I had restart the service. 
Thanks, 
Abhi. 
_______________________________________________
torqueusers mailing list
torqueusers at supercluster.org
http://www.supercluster.org/mailman/listinfo/torqueusers


More information about the torqueusers mailing list