[torqueusers] Help: Unauthorized Request
Gus Correa
gus at ldeo.columbia.edu
Tue Sep 27 13:01:38 MDT 2011
Now I wonder if part of the problem is due to it being
a virtual machine.
- Does torque work in a virtual environment?
- How does MPI {whatever MPI you're using] behave
[works?, performs well?] in a virtual environment?
- Does something as big as ccsm [your ultimate goal apparently]
work in a virtual environment?
Honestly, I don't really know.
For what it is worth, we run ccsm/cesm in a Linux cluster with
Torque, OpenMPI, etc.
No virtualization, though.
Gus Correa
Aaron T Perry wrote:
> This is a single machine, it's a virtual machine running on my Windows 7
> desktop. Thanks, I'm trying your suggestion now.
>
> Thanks,
> Aaron
>
> On Tue, Sep 27, 2011 at 2:10 PM, Gus Correa <gus at ldeo.columbia.edu
> <mailto:gus at ldeo.columbia.edu>> wrote:
>
> Aron
>
> You can set the stack size unlimited in /etc/security/limits.conf
> (here along with locked memory and number of open files):
>
> * - memlock -1
> * - stack -1
> * - nofile 4096
>
> Granted that the above is RHEL/CentOS style,
> Debian/Ubuntu may be different/different file.
>
> Also, you may want to check your /var/log/messages [or whatever Ubuntu
> uses for system logs] and see if it sheds more light into
> the pbs_server errors.
>
> My guess is that you need consistent server names in server_name,
> server_priv/nodes [assuming your server is also a work
> node running pbs_mom], mom_priv/config (for $pbsserver).
> My recollection is that these default to 'localhost' [and 127.0.0.1],
> if your installation is in a *single standalone machine*,
> but I am not sure.
> And you need right name resolution in /etc/hosts, as Mike Reppert
> and Jerry Smith pointed out.
>
> Also, not related, but you need to enable scheduling [after the
> current problem is sorted out]:
>
> qmgr -c 'set server scheduling = True'
>
> Out of curiosity, is it a single machine or a small cluster?
>
> I hope this helps,
> Gus Correa
>
> Aaron T Perry wrote:
> > Yes, that I do i have, that was the first thing I came across when
> > looking through help online.
> >
> > And I added the nodes file with appropriate settings for my
> machine, but
> > I still get the same errors.
> >
> > I have a completely unrelated question. I'm doing all this to run a
> > model that I've been trying to port. I'm trying to figure out
> whether a
> > segmentation fault I'm getting at runtime (using mpirun
> ./ccsm.exe) is
> > due to a compiler error, or a stack/memory error (the code works
> on many
> > other machines, not necessarily the compiler I'm using though).
> If I can
> > install torque I can use an automated script that also
> > sets appropriate stack size, among other things. I am on 1 computer,
> > with 1 node, and I have no desire to scale this instance of the
> model.
> > Basically I'm wondering if you think there might be an easier/better
> > alternative?
> >
> > Thank you,
> > Aaron
> >
> >
> > On Tue, Sep 27, 2011 at 1:04 PM, Smith, Jerry Don II
> <jdsmit at sandia.gov <mailto:jdsmit at sandia.gov>
> > <mailto:jdsmit at sandia.gov <mailto:jdsmit at sandia.gov>>> wrote:
> >
> > $PBS_HOME/server_priv/nodes needs to encompass your compute nodes
> >
> > node1 np=4 # or however many cores you have
> > node2 np=4
> >
> > Make sure that those nodes can be resolved via those names
> from the
> > admin node.
> >
> > Do you have $PBS_HOME/server_name file with the resolvable
> name of
> > your admin server?
> >
> > -Jerry
> >
> > From: Aaron T Perry <atp42 at cornell.edu
> <mailto:atp42 at cornell.edu> <mailto:atp42 at cornell.edu
> <mailto:atp42 at cornell.edu>>>
> > Reply-To: Torque Users Mailing List
> <torqueusers at supercluster.org <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>>
> > Date: Tue, 27 Sep 2011 12:58:50 -0400
> >
> > To: Torque Users Mailing List <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>>
> > Subject: Re: [torqueusers] Help: Unauthorized Request
> >
> > I think I have, I needed to create the file, and I was unsure
> about
> > the formatting required.
> > This is what I have there.
> >
> > # + + ubuntu atp42
> >
> > Do I also need to create the nodes file in the torque>server_priv
> > directory?
> >
> > Thanks,
> > Aaron
> >
> > On Tue, Sep 27, 2011 at 12:40 PM, Smith, Jerry Don II
> > <jdsmit at sandia.gov <mailto:jdsmit at sandia.gov>
> <mailto:jdsmit at sandia.gov <mailto:jdsmit at sandia.gov>>> wrote:
> >
> > Have you set up hosts.equiv?
> >
> > see:
> http://www.clusterresources.com/torquedocs/1.3advconfig.shtml
> >
> >
> > 1.3.2.1 Server Configuration Overview
> >
> > There are several steps to ensure that the server and the
> nodes
> > are completely aware of each other and able to communicate
> > directly. Some of this configuration takes place within
> TORQUE
> > directly using the *qmgr* command. Other configuration
> settings
> > are managed using the *pbs_server*nodes file, DNS files
> such as
> > /etc/hosts and the /etc/hosts.equiv file.
> >
> >
> > 1.3.2.2 Name Service Configuration
> >
> > Each node, as well as the server, must be able to resolve the
> > name of every node with which it will interact. This can be
> > accomplished using /etc/hosts, *DNS*, *NIS*, or other
> > mechanisms. In the case of /etc/hosts, the file can be shared
> > across systems in most cases.
> >
> >
> > -Jerry
> >
> >
> > From: Aaron T Perry <atp42 at cornell.edu
> <mailto:atp42 at cornell.edu> <mailto:atp42 at cornell.edu
> <mailto:atp42 at cornell.edu>>>
> > Reply-To: Torque Users Mailing List
> > <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>>
> > Date: Tue, 27 Sep 2011 12:33:31 -0400
> >
> > To: Torque Users Mailing List
> <torqueusers at supercluster.org <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>>
> > Subject: Re: [torqueusers] Help: Unauthorized Request
> >
> > With the execption of the unauthorized request entries it
> looks
> > like almost everything is okay, execpt for the node file and
> > root <at> localhost (this should be root <at> ubuntu.
> >
> > Thank you for your help!
> > Aaron
> >
> > Here is an except from the server log...
> >
> > 09/27/2011 09:51:31;0002;PBS_Server;Svr;Log;Log opened
> > 09/27/2011 09:51:31;0006;PBS_Server;Svr;PBS_Server;Server
> ubuntu
> > started, initialization type = 4
> > 09/27/2011 09:51:42;0002;PBS_Server;Svr;Log;Log opened
> > 09/27/2011 09:51:42;0006;PBS_Server;Svr;PBS_Server;Server
> ubuntu
> > started, initialization type = 4
> > 09/27/2011 09:51:44;0002;PBS_Server;Svr;Act;Account file
> > /var/spool/torque/server_priv/accounting/20110927 opened
> > 09/27/2011
> 09:51:44;0040;PBS_Server;Req;setup_nodes;setup_nodes()
> > 09/27/2011 09:51:44;0004;PBS_Server;Svr;ubuntu;cannot
> open node
> > description file '/var/spool/torque/server_priv/nodes' in
> > setup_nodes()
> > 09/27/2011
> 09:51:44;0002;PBS_Server;Svr;PBS_Server;Expected 0,
> > recovered 0 queues
> > 09/27/2011
> 09:51:44;0002;PBS_Server;Svr;PBS_Server;Expected 0,
> > recovered 0 jobs
> > 09/27/2011 09:51:44;0006;PBS_Server;Svr;PBS_Server;Using
> ports
> > Server:15001 Scheduler:15004 MOM:15002 (server: 'ubuntu')
> > 09/27/2011
> 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:
> > parent is exiting
> > 09/27/2011
> 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:
> > parent is exiting
> > 09/27/2011
> 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:
> > child process in background
> > 09/27/2011 09:51:44;0002;PBS_Server;Svr;PBS_Server;Server
> Ready,
> > pid = 11995, loglevel=0
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject
> reply
> > code=15007(Unauthorized Request ), aux=0, type=Manager, from
> > root at localhost
> > 09/27/2011 09:51:49;0002;PBS_Server;Svr;PBS_Server;Torque
> Server
> > Version = 3.0.2, loglevel = 0
> > 09/27/2011 09:56:49;0002;PBS_Server;Svr;PBS_Server;Torque
> Server
> > Version = 3.0.2, loglevel = 0
> > ...
> >
> > On Tue, Sep 27, 2011 at 12:13 PM, Smith, Jerry Don II
> > <jdsmit at sandia.gov <mailto:jdsmit at sandia.gov>
> <mailto:jdsmit at sandia.gov <mailto:jdsmit at sandia.gov>>> wrote:
> >
> > Are you seeing anything in the pbs_server logs?
> >
> > -Jerry
> >
> > From: Aaron T Perry <atp42 at cornell.edu
> <mailto:atp42 at cornell.edu>
> > <mailto:atp42 at cornell.edu <mailto:atp42 at cornell.edu>>>
> > Reply-To: Torque Users Mailing List
> > <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>>
> > Date: Tue, 27 Sep 2011 09:53:28 -0400
> > To: <torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>>
> > Subject: Re: [torqueusers] Help: Unauthorized Request
> >
> > Please, any help you can give would be greatly
> appreciated,
> > I'm completely stuck. All the solutions I found
> online have
> > failed.
> >
> > On Mon, Sep 26, 2011 at 2:35 PM, Aaron
> <atp42 at cornell.edu <mailto:atp42 at cornell.edu>
> > <mailto:atp42 at cornell.edu
> <mailto:atp42 at cornell.edu>>> wrote:
> >
> > Hi,
> >
> > I've just tried to install torque, and I ran the
> > following commands,
> >
> > ./configure
> > sudo make
> > sudo make install
> >
> > however when I run ./torque.setup username I get the
> > following...
> >
> > initializing TORQUE (admin: username at ubuntu)
> > PBS_Server ubuntu: Create mode and server
> database exists,
> > do you wish to continue y/(n)?y
> > Max open servers: 9
> > qmgr obj= svr=default: Unauthorized Request
> > Max open servers: 9
> > qmgr obj= svr=default: Unauthorized Request
> > qmgr obj= svr=default: Unauthorized Request
> > qmgr obj= svr=default: Unauthorized Request
> > qmgr obj= svr=default: Unauthorized Request
> > qmgr obj=batch svr=default: Unauthorized Request
> > qmgr obj=batch svr=default: Unauthorized Request
> > qmgr obj=batch svr=default: Unauthorized Request
> > qmgr obj=batch svr=default: Unauthorized Request
> > qmgr obj=batch svr=default: Unauthorized Request
> > qmgr obj=batch svr=default: Unauthorized Request
> > qmgr obj= svr=default: Unauthorized Request
> >
> > The server lanched and I cannot stop it, nor can
> issue
> > any command related to torque (qterm, gmgr, qsub,
> etc)
> > under my current username or under root. Help!
> >
> > Thank you,
> > -Aaron
> >
> >
> > _______________________________________________
> torqueusers
> > mailing list torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> > http://www.supercluster.org/mailman/listinfo/torqueusers
> >
> > _______________________________________________
> > torqueusers mailing list
> > torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> > http://www.supercluster.org/mailman/listinfo/torqueusers
> >
> >
> > _______________________________________________ torqueusers
> > mailing list torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> > http://www.supercluster.org/mailman/listinfo/torqueusers
> >
> > _______________________________________________
> > torqueusers mailing list
> > torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> > http://www.supercluster.org/mailman/listinfo/torqueusers
> >
> >
> > _______________________________________________ torqueusers
> mailing
> > list torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> > <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> > http://www.supercluster.org/mailman/listinfo/torqueusers
> >
> > _______________________________________________
> > torqueusers mailing list
> > torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>
> <mailto:torqueusers at supercluster.org
> <mailto:torqueusers at supercluster.org>>
> > http://www.supercluster.org/mailman/listinfo/torqueusers
> >
> >
> >
> >
> ------------------------------------------------------------------------
> >
> > _______________________________________________
> > torqueusers mailing list
> > torqueusers at supercluster.org <mailto:torqueusers at supercluster.org>
> > http://www.supercluster.org/mailman/listinfo/torqueusers
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org <mailto:torqueusers at supercluster.org>
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
More information about the torqueusers
mailing list