[torqueusers] Help: Unauthorized Request

Coyle, James J [ITACD] jjc at iastate.edu
Tue Sep 27 13:14:43 MDT 2011


  If you continue to have problems, I woud  check if this is really a problem with the Virtual environment.

  I would suggest creating a liveUSB stick with a linux distribution on it.

I've used LinuxLive USB Creator:  http://www.linuxliveusb.com/

to create a bootable USB thumbdrive.

I chose 8GB, and did not have to be picky about the number of packages I use.

The installer runs on Windows, and will create a Linux distribution (I use fedora) on a new thumb drive.
It has a GUI interface, and you don't need special knowledge other than knowing about Unix.
(The persistent image is the portiona that you can use to make updates to your install, I made that 2GB
on my 8GB stick.)

  Pick as many packages as you think that you need, and they will be downloaded and installed.  If you
need something later, you can use yum to install it.

E.g.


  yum install gcc-gfortran boost boost-devel java*


  If your computer's boot order is set to boot from a removable drive before a far disk,
you can just reboot the computer and it should boot from USB.  If this is not set,
you can interrupt the boot processes when it says something like BOOT ORDER F12
by pressing the F12 key, and then selecting the USB drive.

  You can run off the USB drive and it will not affect you machine at all, just shutdown,
pull the USB stick, and power up and your back in Windows again.

  This could check whether the problem is with gcc or with the Virtual machine (did you make the virtual machine with enough memory?)


James Coyle, PhD
High Performance Computing Group
 Iowa State Univ.
web: http://jjc.public.iastate.edu/<http://www.public.iastate.edu/~jjc>



From: torqueusers-bounces at supercluster.org [mailto:torqueusers-bounces at supercluster.org] On Behalf Of Aaron T Perry
Sent: Tuesday, September 27, 2011 1:55 PM
To: Torque Users Mailing List
Subject: Re: [torqueusers] Help: Unauthorized Request

Thank you, I'll try these suggestions. I'm relatively new at this and sometimes feel I'm in over my head.

I'm almost certain this is a compiler or stack limit error. I didn't write the code, and it's known to work on a variety of systems but only commercial compilers are officially supported (i'm using gcc).

Thank you again,
Aaron

On Tue, Sep 27, 2011 at 2:45 PM, Coyle, James J [ITACD] <jjc at iastate.edu<mailto:jjc at iastate.edu>> wrote:
For just one computer, which write the following script files (assumes you have 256GB of memory,
modify as needed.)

scr0:
#!/bin/bash

for j in 1G 2G 4G 8G 16G 32G 64G 128G 256G ; do
echo "Try $j "
./scr1  $j
done
exit



scr1:
#!/bin/csh -f

setenv F t1.$$
/bin/rm -f $F
hostname >  $F
limit stacksize $1
mpirun -n 4 --machinefile $F ./ccsm.exe
/bin/rm -f $F
exit


make both executable with
chmod u+x scr0 scr1


and then issue

  ./scr0


Modify the above procedure as needed.

  If this is not just caused by a stack limit error, I'd look at either a compiler optimization bug (recompile run with -O0
and run) or more likely a programming error (we all make them.)

  I'd recompile and check for bounds (e.g. -C on most Fortran compilers), and uninitialized variables (-uvar on
PathScale or Open64 compilers.  -rabc also works well on Cray Compilers.

  You can also use a parallel debugger like Totalview or DDT, or you can use a run-time error detection tool
like MPI-CHECK (Fortran only) or Marmot.  (See http://rted.public.iastate.edu/MPI/RESULTS/result_table.html
for the kinds of errors that these can catch)  See http://rted.public.iastate.edu/Serial/RESULTS/result_table.html
for program errors other than those involving MPI routines.  If you click on items under the
OS/Compiler/Runtime tool column, you can see the suggested compiler options for best debugging
for that Compiler or tool.





From: torqueusers-bounces at supercluster.org<mailto:torqueusers-bounces at supercluster.org> [mailto:torqueusers-bounces at supercluster.org<mailto:torqueusers-bounces at supercluster.org>] On Behalf Of Aaron T Perry
Sent: Tuesday, September 27, 2011 12:23 PM

To: Torque Users Mailing List
Subject: Re: [torqueusers] Help: Unauthorized Request

Yes, that I do i have, that was the first thing I came across when looking through help online.

And I added the nodes file with appropriate settings for my machine, but I still get the same errors.

I have a completely unrelated question. I'm doing all this to run a model that I've been trying to port. I'm trying to figure out whether a segmentation fault I'm getting at runtime (using mpirun ./ccsm.exe) is due to a compiler error, or a stack/memory error (the code works on many other machines, not necessarily the compiler I'm using though). If I can install torque I can use an automated script that also sets appropriate stack size, among other things. I am on 1 computer, with 1 node, and I have no desire to scale this instance of the model. Basically I'm wondering if you think there might be an easier/better alternative?

Thank you,
Aaron

On Tue, Sep 27, 2011 at 1:04 PM, Smith, Jerry Don II <jdsmit at sandia.gov<mailto:jdsmit at sandia.gov>> wrote:
$PBS_HOME/server_priv/nodes needs to encompass your compute nodes

node1 np=4 # or however many cores you have
node2 np=4

Make sure that those nodes can be resolved via those names from the admin node.

Do you have $PBS_HOME/server_name file with the resolvable name of your admin server?

-Jerry

From: Aaron T Perry <atp42 at cornell.edu<mailto:atp42 at cornell.edu>>
Reply-To: Torque Users Mailing List <torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>>
Date: Tue, 27 Sep 2011 12:58:50 -0400

To: Torque Users Mailing List <torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>>
Subject: Re: [torqueusers] Help: Unauthorized Request

I think I have, I needed to create the file, and I was unsure about the formatting required.
This is what I have there.

# + + ubuntu atp42

Do I also need to create the nodes file in the torque>server_priv directory?

Thanks,
Aaron

On Tue, Sep 27, 2011 at 12:40 PM, Smith, Jerry Don II <jdsmit at sandia.gov<mailto:jdsmit at sandia.gov>> wrote:
Have you set up hosts.equiv?

see: http://www.clusterresources.com/torquedocs/1.3advconfig.shtml

1.3.2.1 Server Configuration Overview

There are several steps to ensure that the server and the nodes are completely aware of each other and able to communicate directly. Some of this configuration takes place within TORQUE directly using the qmgr command. Other configuration settings are managed using the pbs_servernodes file, DNS files such as /etc/hosts and the /etc/hosts.equiv file.

1.3.2.2 Name Service Configuration

Each node, as well as the server, must be able to resolve the name of every node with which it will interact. This can be accomplished using /etc/hosts, DNS, NIS, or other mechanisms. In the case of /etc/hosts, the file can be shared across systems in most cases.



-Jerry

From: Aaron T Perry <atp42 at cornell.edu<mailto:atp42 at cornell.edu>>
Reply-To: Torque Users Mailing List <torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>>
Date: Tue, 27 Sep 2011 12:33:31 -0400

To: Torque Users Mailing List <torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>>
Subject: Re: [torqueusers] Help: Unauthorized Request

With the execption of the unauthorized request entries it looks like almost everything is okay, execpt for the node file and root <at> localhost (this should be root <at> ubuntu.

Thank you for your help!
Aaron

Here is an except from the server log...

09/27/2011 09:51:31;0002;PBS_Server;Svr;Log;Log opened
09/27/2011 09:51:31;0006;PBS_Server;Svr;PBS_Server;Server ubuntu started, initialization type = 4
09/27/2011 09:51:42;0002;PBS_Server;Svr;Log;Log opened
09/27/2011 09:51:42;0006;PBS_Server;Svr;PBS_Server;Server ubuntu started, initialization type = 4
09/27/2011 09:51:44;0002;PBS_Server;Svr;Act;Account file /var/spool/torque/server_priv/accounting/20110927 opened
09/27/2011 09:51:44;0040;PBS_Server;Req;setup_nodes;setup_nodes()
09/27/2011 09:51:44;0004;PBS_Server;Svr;ubuntu;cannot open node description file '/var/spool/torque/server_priv/nodes' in setup_nodes()
09/27/2011 09:51:44;0002;PBS_Server;Svr;PBS_Server;Expected 0, recovered 0 queues
09/27/2011 09:51:44;0002;PBS_Server;Svr;PBS_Server;Expected 0, recovered 0 jobs
09/27/2011 09:51:44;0006;PBS_Server;Svr;PBS_Server;Using ports Server:15001  Scheduler:15004  MOM:15002 (server: 'ubuntu')
09/27/2011 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:      parent is exiting
09/27/2011 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:      parent is exiting
09/27/2011 09:51:44;0002;PBS_Server;Svr;daemonize_server;INFO:      child process in background
09/27/2011 09:51:44;0002;PBS_Server;Svr;PBS_Server;Server Ready, pid = 11995, loglevel=0
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:44;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from root at localhost
09/27/2011 09:51:49;0002;PBS_Server;Svr;PBS_Server;Torque Server Version = 3.0.2, loglevel = 0
09/27/2011 09:56:49;0002;PBS_Server;Svr;PBS_Server;Torque Server Version = 3.0.2, loglevel = 0
...

On Tue, Sep 27, 2011 at 12:13 PM, Smith, Jerry Don II <jdsmit at sandia.gov<mailto:jdsmit at sandia.gov>> wrote:
Are you seeing anything in the pbs_server logs?

-Jerry

From: Aaron T Perry <atp42 at cornell.edu<mailto:atp42 at cornell.edu>>
Reply-To: Torque Users Mailing List <torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>>
Date: Tue, 27 Sep 2011 09:53:28 -0400
To: <torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>>
Subject: Re: [torqueusers] Help: Unauthorized Request

Please, any help you can give would be greatly appreciated, I'm completely stuck. All the solutions I found online have failed.
On Mon, Sep 26, 2011 at 2:35 PM, Aaron <atp42 at cornell.edu<mailto:atp42 at cornell.edu>> wrote:
Hi,

I've just tried to install torque, and I ran the following commands,

./configure
sudo make
sudo make install

however when I run ./torque.setup username I get the following...

initializing TORQUE (admin: username at ubuntu)
PBS_Server ubuntu: Create mode and server database exists,
do you wish to continue y/(n)?y
Max open servers: 9
qmgr obj= svr=default: Unauthorized Request
Max open servers: 9
qmgr obj= svr=default: Unauthorized Request
qmgr obj= svr=default: Unauthorized Request
qmgr obj= svr=default: Unauthorized Request
qmgr obj= svr=default: Unauthorized Request
qmgr obj=batch svr=default: Unauthorized Request
qmgr obj=batch svr=default: Unauthorized Request
qmgr obj=batch svr=default: Unauthorized Request
qmgr obj=batch svr=default: Unauthorized Request
qmgr obj=batch svr=default: Unauthorized Request
qmgr obj=batch svr=default: Unauthorized Request
qmgr obj= svr=default: Unauthorized Request

The server lanched and I cannot stop it, nor can issue any command related to torque (qterm, gmgr, qsub, etc) under my current username or under root. Help!

Thank you,
-Aaron


_______________________________________________ torqueusers mailing list torqueusers at supercluster.org<mailto:torqueusers at supercluster.org> http://www.supercluster.org/mailman/listinfo/torqueusers

_______________________________________________
torqueusers mailing list
torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>
http://www.supercluster.org/mailman/listinfo/torqueusers

_______________________________________________ torqueusers mailing list torqueusers at supercluster.org<mailto:torqueusers at supercluster.org> http://www.supercluster.org/mailman/listinfo/torqueusers

_______________________________________________
torqueusers mailing list
torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>
http://www.supercluster.org/mailman/listinfo/torqueusers

_______________________________________________ torqueusers mailing list torqueusers at supercluster.org<mailto:torqueusers at supercluster.org> http://www.supercluster.org/mailman/listinfo/torqueusers

_______________________________________________
torqueusers mailing list
torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>
http://www.supercluster.org/mailman/listinfo/torqueusers


_______________________________________________
torqueusers mailing list
torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>
http://www.supercluster.org/mailman/listinfo/torqueusers

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20110927/6de1df57/attachment-0001.html 


More information about the torqueusers mailing list