[torqueusers] init_network failed dis and Unauthorized request
Garrick Staples
garrick at usc.edu
Tue Feb 12 11:56:22 MST 2008
On Tue, Feb 12, 2008 at 09:40:08AM -0800, Jan alleged:
> Hello all,
>
> I ran into some more challenges. After I had Torque running on a single
> machine for a while, I got a second machine and started building a
> little cluster. I followed
> http://debianclusters.cs.uni.edu/index.php/Main_Page which was great.
>
> My headnode has two ethernet interfaces:
> eth0 is the outside world and resolves to homathko
> eth1 resolves to homathko1 and is what the other node sees.
> I also created a tourque.cfg with the line
> SERVERHOST homathko1
Unless you are using an old version, pbs_server doesn't read torque.cfg. Use
the 'server_name' server attribute in qmgr.
> OK, so I compile torque with:
> ./configure --with-gnu-ld --prefix=/usr/local --with-rcp=scp
> --with-default-server=homathko1
--with-gnu-ld? Which OS is this?
> homathko1 resolves to 192.168.1.200 on eth1
> the other name is homathko which resolves to my outside address on eth0
>
> The I try to create the server database:
> root at homathko:/usr/local/torque-2.2.1#pbs_server -t create
> pbs_server: network: Address already in use
> PBS_Server: PBS_Server, init_network failed dis
>
> That sounds strange. But after killing the pbs_server and not changing
> anything else, it starts:
That's not strange. Only one daemon can run at a time.
> root at homathko:/usr/local/torque-2.2.1# ps -e |grep pbs
> 3248 ? 00:00:00 pbs_server
> root at homathko:/usr/local/torque-2.2.1# kill -9 3248
> root at homathko:/usr/local/torque-2.2.1# ps -e |grep pbs
> root at homathko:/usr/local/torque-2.2.1# pbs_server -t create
Perfectly normal. You have to kill the running daemon before you can run a new
daemon.
> However, if I grep in $PBSHOME for my outside name, it seems as if the
> server starts with my outside name:
>
> root at homathko:/var/spool/torque# grep -r seos * |less
> server_logs/20080212:02/12/2008
> 09:29:30;0006;PBS_Server;Svr;PBS_Server;Server homathko.seos.uvic.ca
> started, initialization type = 4
> server_logs/20080212:02/12/2008
> 09:29:30;0004;PBS_Server;Svr;homathko.seos.uvic.ca;cannot open node
> description file '/var/spool/torque/server_priv/nodes' in setup_nodes()
> server_logs/20080212:02/12/2008
> 09:29:30;0004;PBS_Server;Svr;homathko.seos.uvic.ca;[continued]
>
> And I think that is why I get "Unauthorized Request":
>
> root at homathko:/usr/local/torque-2.2.1# qmgr
> Max open servers: 4
> Qmgr: set server scheduling=true
> qmgr obj= svr=default: Unauthorized Request
> Qmgr:
'qmgr homathko' and set server_name.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20080212/3b12a989/attachment.bin
More information about the torqueusers
mailing list