[torqueusers] Help: Unauthorized Request
Gus Correa
gus at ldeo.columbia.edu
Tue Sep 27 12:10:32 MDT 2011
Aron
You can set the stack size unlimited in /etc/security/limits.conf
(here along with locked memory and number of open files):
* - memlock -1
* - stack -1
* - nofile 4096
Granted that the above is RHEL/CentOS style,
Debian/Ubuntu may be different/different file.
Also, you may want to check your /var/log/messages [or whatever Ubuntu
uses for system logs] and see if it sheds more light into
the pbs_server errors.
My guess is that you need consistent server names in server_name,
server_priv/nodes [assuming your server is also a work
node running pbs_mom], mom_priv/config (for $pbsserver).
My recollection is that these default to 'localhost' [and 127.0.0.1],
if your installation is in a *single standalone machine*,
but I am not sure.
And you need right name resolution in /etc/hosts, as Mike Reppert
and Jerry Smith pointed out.
Also, not related, but you need to enable scheduling [after the
current problem is sorted out]:
qmgr -c 'set server scheduling = True'
Out of curiosity, is it a single machine or a small cluster?
I hope this helps,
Gus Correa
Aaron T Perry wrote:
> Yes, that I do i have, that was the first thing I came across when
> looking through help online.
>
> And I added the nodes file with appropriate settings for my machine, but
> I still get the same errors.
>
> I have a completely unrelated question. I'm doing all this to run a
> model that I've been trying to port. I'm trying to figure out whether a
> segmentation fault I'm getting at runtime (using mpirun ./ccsm.exe) is
> due to a compiler error, or a stack/memory error (the code works on many
> other machines, not necessarily the compiler I'm using though). If I can
> install torque I can use an automated script that also
> sets appropriate stack size, among other things. I am on 1 computer,
> with 1 node, and I have no desire to scale this instance of the model.
> Basically I'm wondering if you think there might be an easier/better
> alternative?
>
> Thank you,
> Aaron
>
>
> On Tue, Sep 27, 2011 at 1:04 PM, Smith, Jerry Don II <jdsmit at sandia.gov
> <mailto:jdsmit at sandia.gov>> wrote:
>
> $PBS_HOME/server_priv/nodes needs to encompass your compute nodes
>
> node1 np=4 # or however many cores you have
> node2 np=4
>
> Make sure that those nodes can be resolved via those names from the
> admin node.
>
> Do you have $PBS_HOME/server_name file with the resolvable name of
> your admin server?
>
> -Jerry
>
> From: Aaron T Perry <atp42 at cornell.edu <mailto:atp42 at cornell.edu>>
> Reply-To: Torque Users Mailing List <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> Date: Tue, 27 Sep 2011 12:58:50 -0400
>
> To: Torque Users Mailing List <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> Subject: Re: [torqueusers] Help: Unauthorized Request
>
> I think I have, I needed to create the file, and I was unsure about
> the formatting required.
> This is what I have there.
>
> # + + ubuntu atp42
>
> Do I also need to create the nodes file in the torque>server_priv
> directory?
>
> Thanks,
> Aaron
>
> On Tue, Sep 27, 2011 at 12:40 PM, Smith, Jerry Don II
> <jdsmit at sandia.gov <mailto:jdsmit at sandia.gov>> wrote:
>
> Have you set up hosts.equiv?
>
> see: http://www.clusterresources.com/torquedocs/1.3advconfig.shtml
>
>
> 1.3.2.1 Server Configuration Overview
>
> There are several steps to ensure that the server and the nodes
> are completely aware of each other and able to communicate
> directly. Some of this configuration takes place within TORQUE
> directly using the *qmgr* command. Other configuration settings
> are managed using the *pbs_server*nodes file, DNS files such as
> /etc/hosts and the /etc/hosts.equiv file.
>
>
> 1.3.2.2 Name Service Configuration
>
> Each node, as well as the server, must be able to resolve the
> name of every node with which it will interact. This can be
> accomplished using /etc/hosts, *DNS*, *NIS*, or other
> mechanisms. In the case of /etc/hosts, the file can be shared
> across systems in most cases.
>
>
> -Jerry
>
>
> From: Aaron T Perry <atp42 at cornell.edu <mailto:atp42 at cornell.edu>>
> Reply-To: Torque Users Mailing List
> <torqueusers at supercluster.org <mailto:torqueusers at supercluster.org>>
> Date: Tue, 27 Sep 2011 12:33:31 -0400
>
> To: Torque Users Mailing List <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> Subject: Re: [torqueusers] Help: Unauthorized Request
>
> With the execption of the unauthorized request entries it looks
> like almost everything is okay, execpt for the node file and
> root <at> localhost (this should be root <at> ubuntu.
>
> Thank you for your help!
> Aaron
>
> Here is an except from the server log...
>
> 09/27/2011 09:51:31;0002;PBS_Server;Svr;Log;Log opened
> 09/27/2011 09:51:31;0006;PBS_Server;Svr;PBS_Server;Server ubuntu
> started, initialization type = 4
> 09/27/2011 09:51:42;0002;PBS_Server;Svr;Log;Log opened
> 09/27/2011 09:51:42;0006;PBS_Server;Svr;PBS_Server;Server ubuntu
> started, initialization type = 4
> 09/27/2011 09:51:44;0002;PBS_Server;Svr;Act;Account file
> /var/spool/torque/server_priv/accounting/20110927 opened
> 09/27/2011 09:51:44;0040;PBS_Server;Req;setup_nodes;setup_nodes()
> 09/27/2011 09:51:44;0004;PBS_Server;Svr;ubuntu;cannot open node
> description file '/var/spool/torque/server_priv/nodes' in
> setup_nodes()
> 09/27/2011 09:51:44;0002;PBS_Server;Svr;PBS_Server;Expected 0,
> recovered 0 queues
> 09/27/2011 09:51:44;0002;PBS_Server;Svr;PBS_Server;Expected 0,
> recovered 0 jobs
> 09/27/2011 09:51:44;0006;PBS_Server;Svr;PBS_Server;Using ports
> Server:15001 Scheduler:15004 MOM:15002 (server: 'ubuntu')
> 09/27/2011 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:
> parent is exiting
> 09/27/2011 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:
> parent is exiting
> 09/27/2011 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:
> child process in background
> 09/27/2011 09:51:44;0002;PBS_Server;Svr;PBS_Server;Server Ready,
> pid = 11995, loglevel=0
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply
> code=15007(Unauthorized Request ), aux=0, type=Manager, from
> root at localhost
> 09/27/2011 09:51:49;0002;PBS_Server;Svr;PBS_Server;Torque Server
> Version = 3.0.2, loglevel = 0
> 09/27/2011 09:56:49;0002;PBS_Server;Svr;PBS_Server;Torque Server
> Version = 3.0.2, loglevel = 0
> ...
>
> On Tue, Sep 27, 2011 at 12:13 PM, Smith, Jerry Don II
> <jdsmit at sandia.gov <mailto:jdsmit at sandia.gov>> wrote:
>
> Are you seeing anything in the pbs_server logs?
>
> -Jerry
>
> From: Aaron T Perry <atp42 at cornell.edu
> <mailto:atp42 at cornell.edu>>
> Reply-To: Torque Users Mailing List
> <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> Date: Tue, 27 Sep 2011 09:53:28 -0400
> To: <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> Subject: Re: [torqueusers] Help: Unauthorized Request
>
> Please, any help you can give would be greatly appreciated,
> I'm completely stuck. All the solutions I found online have
> failed.
>
> On Mon, Sep 26, 2011 at 2:35 PM, Aaron <atp42 at cornell.edu
> <mailto:atp42 at cornell.edu>> wrote:
>
> Hi,
>
> I've just tried to install torque, and I ran the
> following commands,
>
> ./configure
> sudo make
> sudo make install
>
> however when I run ./torque.setup username I get the
> following...
>
> initializing TORQUE (admin: username at ubuntu)
> PBS_Server ubuntu: Create mode and server database exists,
> do you wish to continue y/(n)?y
> Max open servers: 9
> qmgr obj= svr=default: Unauthorized Request
> Max open servers: 9
> qmgr obj= svr=default: Unauthorized Request
> qmgr obj= svr=default: Unauthorized Request
> qmgr obj= svr=default: Unauthorized Request
> qmgr obj= svr=default: Unauthorized Request
> qmgr obj=batch svr=default: Unauthorized Request
> qmgr obj=batch svr=default: Unauthorized Request
> qmgr obj=batch svr=default: Unauthorized Request
> qmgr obj=batch svr=default: Unauthorized Request
> qmgr obj=batch svr=default: Unauthorized Request
> qmgr obj=batch svr=default: Unauthorized Request
> qmgr obj= svr=default: Unauthorized Request
>
> The server lanched and I cannot stop it, nor can issue
> any command related to torque (qterm, gmgr, qsub, etc)
> under my current username or under root. Help!
>
> Thank you,
> -Aaron
>
>
> _______________________________________________ torqueusers
> mailing list torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
> _______________________________________________ torqueusers
> mailing list torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org <mailto:torqueusers at supercluster.org>
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
> _______________________________________________ torqueusers mailing
> list torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org <mailto:torqueusers at supercluster.org>
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
More information about the torqueusers
mailing list