[torqueusers] init_network failed dis and Unauthorized request

Jan jand at uvic.ca
Tue Feb 12 10:40:08 MST 2008


Hello all,

I ran into some more challenges. After I had Torque running on a single 
machine for a while, I got a second machine and started building a 
little cluster. I followed 
http://debianclusters.cs.uni.edu/index.php/Main_Page which was great.

My headnode has two ethernet interfaces:
eth0 is the outside world and resolves to homathko
eth1 resolves to homathko1 and is what the other node sees.
I also created a tourque.cfg with the line
SERVERHOST             homathko1

OK, so I compile torque with:
./configure --with-gnu-ld --prefix=/usr/local --with-rcp=scp 
--with-default-server=homathko1

homathko1 resolves to 192.168.1.200 on eth1
the other name is homathko which resolves to my outside address on eth0

The I try to create the server database:
root at homathko:/usr/local/torque-2.2.1#pbs_server -t create
pbs_server: network: Address already in use
PBS_Server: PBS_Server, init_network failed dis

That sounds strange. But after killing the pbs_server and not changing 
anything else, it starts:

root at homathko:/usr/local/torque-2.2.1# ps -e |grep pbs
  3248 ?        00:00:00 pbs_server
root at homathko:/usr/local/torque-2.2.1# kill -9 3248
root at homathko:/usr/local/torque-2.2.1# ps -e |grep pbs
root at homathko:/usr/local/torque-2.2.1# pbs_server -t create

However, if I grep in $PBSHOME for my outside name, it seems as if the 
server starts with my outside name:

root at homathko:/var/spool/torque# grep -r seos * |less
server_logs/20080212:02/12/2008 
09:29:30;0006;PBS_Server;Svr;PBS_Server;Server homathko.seos.uvic.ca 
started, initialization type = 4
server_logs/20080212:02/12/2008 
09:29:30;0004;PBS_Server;Svr;homathko.seos.uvic.ca;cannot open node 
description file '/var/spool/torque/server_priv/nodes' in setup_nodes()
server_logs/20080212:02/12/2008 
09:29:30;0004;PBS_Server;Svr;homathko.seos.uvic.ca;[continued]

And I think that is why I get "Unauthorized Request":

root at homathko:/usr/local/torque-2.2.1# qmgr
Max open servers: 4
Qmgr: set server scheduling=true
qmgr obj= svr=default: Unauthorized Request
Qmgr:

Any ideas?

Thank you very much, Jan

-- 
Jan Dettmer, Postdoctoral Fellow
School of Earth and Ocean Sciences, University of Victoria	
Victoria, BC V8W 3P6
office: (250) 472-4342	email: jand at uvic.ca
http://web.uvic.ca/~jand/


More information about the torqueusers mailing list