[torqueusers] mount.nfs starts failing after pbs_server gets "warmed up", starts using up more than 700 sockets (Sabuj Pattanayek)Re:

Sabuj Pattanayek sabujp at gmail.com
Sat Apr 20 18:55:42 MDT 2013


> but I reckon I'm going to have to do a recompile - will the reconfig
> mean I have to push out the pbs_mom binary to the nodes (I can see that
> being an issue), or is it just the pbs_server binary that'll need
> replacing and restarting? If it's the latter, then great.

Just the latter since pbs_server is making a connection per job and
would use up all ports < 1024 which mount.nfs also tries to use
without that configure flag. I'm seeing that pbs_mom is already using
ports > 1024 for the most part:

tcp        0      0 0.0.0.0:15002               0.0.0.0:*
     LISTEN      2492/pbs_mom
tcp        0      0 0.0.0.0:15003               0.0.0.0:*
     LISTEN      2492/pbs_mom
udp        0      0 0.0.0.0:1023                0.0.0.0:*
                 2492/pbs_mom
udp        0      0 0.0.0.0:15003               0.0.0.0:*
                 2492/pbs_mom

which is fine and won't cause any issues with nfs mounting on the nodes.

HTH,
Sabuj


More information about the torqueusers mailing list