[torqueusers] Re: pbsnodes -a do not see the state of all nodes
Hannu Väisänen
hvaisane at joyx.joensuu.fi
Thu Feb 24 01:14:56 MST 2005
On Wed, Feb 23, 2005 at 08:34:42AM -0800, Garrick Staples wrote:
> So mom is rejecting messages from server (momctl doesn't work) and server is
> rejecting messages from mom (message above).
>
> Port filtering or firewalling?
Yes. I put
pbs 15001/tcp
pbs_sched 15004/tcp
pbs_resmon 15003/tcp
pbs_resmon 15003/udp
pbs_mom 15002/tcp
into /etc/services on both machines and enabled ports 1500[1-4]
in firewall and restarted everything that starts with pbs_
Now momctl works but psnodes -a still says
state = state-unknown,down
np = 1
ntype = cluster
momctl -d 4 -h <node-name> says
Host: xxxx Server: xxxx Version: torque_1.2.0p1
HomeDirectory: /usr/spool/PBS/mom_priv
MOM active: 1800 seconds
WARNING: no messages received from server
Server Update Interval: 20 seconds
Server Update Interval: 20 seconds
WARNING: no hello/cluster-addrs messages received from server
Init Msgs Sent: 59 hellos
LOGLEVEL: 0 (use SIGUSR1/SIGUSR2 to adjust)
Communication Model: RPP
TCP Timeout: 20 seconds
Prolog Alarm Time: 300 seconds
Alarm Time: 0 of 10 seconds
Trusted Client List: xxxx
JobList: NONE
diagnostics complete
More information about the torqueusers
mailing list