[torqueusers] communication problems between pbs_mom and pbs_server

Daniel Burbano daniel.burbano at gmail.com
Wed Jan 18 16:08:02 MST 2012


Hello,

My name is Daniel Burbano.  I am installing a cluster with PBS.  I
have problem communications when the pbs_mom try to find the
pbs_server (timeout).  These are the logs:

01/17/2012 17:59:39;0002;   pbs_mom;Svr;Log;Log opened
01/17/2012 17:59:39;0002;   pbs_mom;Svr;pbs_mom;Torque Mom Version = 3.0.3
, loglevel = 0
01/17/2012 17:59:39;0002;   pbs_mom;Svr;setpbsserver;bio01
01/17/2012 17:59:39;0002;   pbs_mom;Svr;mom_server_add;server bio01 added
01/17/2012 17:59:39;0002;   pbs_mom;n/a;initialize;independent
01/17/2012 17:59:39;0080;   pbs_mom;Svr;pbs_mom;before init_abort_jobs
01/17/2012 17:59:39;0002;   pbs_mom;Svr;pbs_mom;Is up
01/17/2012 17:59:39;0002;   pbs_mom;Svr;setup_program_environment;MOM exec
utable path and mtime at launch: /opt/torque/sbin/pbs_mom 1325906867
01/17/2012 17:59:39;0002;   pbs_mom;Svr;pbs_mom;Torque Mom Version = 3.0.3
, loglevel = 0
01/17/2012 17:59:39;0002;   pbs_mom;n/a;mom_server_check_connection;sendin
g hello to server bio01
01/17/2012 18:01:09;0002;   pbs_mom;n/a;mom_server_check_connection;connec
tion to server bio01 timeout
[root at bio03 mom_logs]# tail -100 20120117
01/17/2012 18:10:10;0002;
pbs_mom;n/a;mom_server_check_connection;sending hello to server bio01
01/17/2012 18:11:40;0002;
pbs_mom;n/a;mom_server_check_connection;connection to server bio01
timeout
01/17/2012 18:11:40;0002;
pbs_mom;n/a;mom_server_check_connection;sending hello to server bio01
01/17/2012 18:13:10;0002;
pbs_mom;n/a;mom_server_check_connection;connection to server bio01
timeout


In the other hand, I don´t have problems when the pbs_mom and the
pbs_server are located in the same machine.

The firewall are disabled in the machines.
The machines are in the same network.
The selinux is disabled in the machines.
The ssh without password is configured correctly.
The servers are virtual machine created in AWS.

Any idea?


Thanks


-- 
Daniel Burbano, MCpE


More information about the torqueusers mailing list