[torqueusers] many server-client connections in TIME_WAIT
Ken Nielson
knielson at clusterresources.com
Tue Jun 23 08:28:01 MDT 2009
Arnau,
Do your waiting sockets change after about two to four minutes. I ask
that because the sockets will remain in a TIME_WAIT state for 2*MSL
before they are removed from the table. the MSL is typically set between
30 to 120 seconds.
http://www.developerweb.net/forum/showthread.php?t=2941
Ken Nielson
Cluster Resources, Inc.
Arnau Bria wrote:
> Hi all,
>
> our server has many connections from clients in TIME_WAIT status:
> # netstat -puta|grep pbs|wc -l
> 1071
>
> [...]
> tcp 0 0 pbs02.pic.es:748 td062.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:736 td062.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:709 td062.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:638 td062.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:918 td062.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:1016 td062.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:773 td062.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:924 td060.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:809 td060.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:689 td058.pic.es:pbs_mom TIME_WAIT -
> tcp 0 0 pbs02.pic.es:682 td058.pic.es:pbs_mom TIME_WAIT -
> [...]
>
> If I restart pbs_server, all of them die, but after few second I have
> 1000 conncetions again.
>
> What could be blocking the socket to be closed?
>
> # rpm -qa|grep torque
> torque-client-2.3.0-snap.200801151629.2cri.slc4
> torque-server-2.3.0-snap.200801151629.2cri.slc4
> torque-2.3.0-snap.200801151629.2cri.slc4
>
>
> TIA,
> Arnau
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
More information about the torqueusers
mailing list