[torqueusers] error:Bad Uid for job
lihui
lihui at net.pku.edu.cn
Wed Mar 21 19:40:48 MDT 2007
Hi,all
I have just set up the torque-2.1.8 environment. And the specify the attribute of
server,queue,nodes according to torque documents. I started pbs_server,pbs_sched,
pbs_mom, then input the commands "qsub myscrip.sh" it just throw "bad uid for job
execution" error.
I have looked up the server_log ,and use qmgr>"print server" to got server
specification. Both information was put into the attachment,please have a look.
Any hints and clue will be highly appreciated,thanks.
LiHui
--
Open WebMail Project (http://openwebmail.org)
-------------- next part --------------
server_log
03/22/2007 07:08:32;0006;PBS_Server;Svr;PBS_Server;Using ports Server:15001 Scheduler:15004 MOM:15002
03/22/2007 07:08:32;0002;PBS_Server;Svr;PBS_Server;Server Ready, pid = 2569, loglevel=0
03/22/2007 07:08:32;0004;PBS_Server;Svr;WARNING;ALERT: unable to contact node localhost
03/22/2007 07:08:32;0001;PBS_Server;Svr;PBS_Server;Connection refused (111) in contact_sched, Could not contact Scheduler - port 15004
03/22/2007 07:08:35;0100;PBS_Server;Req;;Type AuthenticateUser request received from root at localhost.localdomain, sock=11
03/22/2007 07:08:37;0040;PBS_Server;Req;ping_nodes;ping attempting to contact 1 nodes
03/22/2007 07:08:37;0040;PBS_Server;Req;ping_nodes;successful ping to node localhost (stream 0)
03/22/2007 07:08:39;0100;PBS_Server;Req;;Type StatusQueue request received from root at localhost.localdomain, sock=10
03/22/2007 07:08:39;0100;PBS_Server;Req;;Type StatusServer request received from root at localhost.localdomain, sock=10
03/22/2007 07:09:17;0001;PBS_Server;Svr;PBS_Server;stream_eof, connection to localhost is bad, remote service may be down, message may be corrupt, or connection may have been dropped remotely (Premature end of message). setting node state to down
03/22/2007 07:13:43;0100;PBS_Server;Req;;Type AuthenticateUser request received from root at localhost.localdomain, sock=11
03/22/2007 07:13:43;0100;PBS_Server;Req;;Type QueueJob request received from root at localhost.localdomain, sock=10
03/22/2007 07:13:43;0080;PBS_Server;Req;req_reject;Reject reply code=15023(Bad UID for job execution), aux=0, type=QueueJob, from root at localhost.localdomain
print server
#
# Create queues and set their attributes.
#
#
# Create and define queue batch
#
create queue batch
set queue batch queue_type = Execution
set queue batch resources_default.nodes = 1
set queue batch resources_default.walltime = 01:00:00
set queue batch enabled = True
set queue batch started = True
#
# Set server attributes.
#
set server scheduling = True
set server managers = root at localhost.localdomain
set server operators = root at localhost.localdomain
set server default_queue = batch
set server log_events = 511
set server mail_from = adm
set server scheduler_iteration = 600
set server node_check_rate = 150
set server tcp_timeout = 6
set server pbs_version = 2.1.8
set server submit_hosts = localhost
set server allow_node_submit = True
More information about the torqueusers
mailing list