[torqueusers] Can't submit job from remote submit host
Alexandr Baskakov
avb at ssau.ru
Fri Mar 30 23:18:33 MDT 2012
Hi, All.
I'am trying to submit job from submit host to remote server with torque.
Have 2 nodes:
mgt1 - torque client
mgt2 - torque server and moab.
Domain: ssc
On mgt2:
[mgt2 ~]$ qmgr -c 'l s'
Server mgt2
server_state = Active
scheduling = True
total_jobs = 0
state_count = Transit:0 Queued:0 Held:0 Waiting:0 Running:0 Exiting:0
acl_hosts = localhost,mgt2,mgt1
managers = root at mgt2,torque at mgt2
operators = root at mgt2,torque at mgt2
default_queue = batch
log_events = 511
mail_from = adm
query_other_jobs = True
resources_assigned.ncpus = 0
resources_assigned.nodect = 0
scheduler_iteration = 600
node_check_rate = 150
tcp_timeout = 6
log_level = 7
mom_job_sync = True
pbs_version = 3.0.2
keep_completed = 300
submit_hosts = mgt1.ssc
next_job_number = 57
net_counter = 2 0 0
When I trying to submit job from mgt1 by:
[mgt1 ~]$ PBS_DEFAULT=mgt2 qsub
hostname
qsub: Bad UID for job execution MSG=ruserok failed validating avb/avb from mgt1
have an error.
On mgt2, in logfile:
03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command AuthenticateUser from avb
03/26/2012 15:52:57;0100;PBS_Server;Req;;Type AuthenticateUser request received from avb at mgt1.ssc, sock=14
03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching request AuthenticateUser on sd=14
03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request type AuthenticateUser on socket 14
03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command Disconnect from PBS_Server
03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command QueueJob from avb
03/26/2012 15:52:57;0100;PBS_Server;Req;;Type QueueJob request received from avb at mgt1.ssc, sock=13
03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching request QueueJob on sd=13
03/26/2012 15:52:57;0080;PBS_Server;Job;62.mgt2;removed job file
03/26/2012 15:52:57;0080;PBS_Server;Req;req_reject;Reject reply code=15025(Bad UID for job execution MSG=ruserok failed validating avb/avb from mgt1), aux=0, type=QueueJob, from avb at mgt1.ssc
03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request type QueueJob on socket 13
Authentication on mgt1,mgt2 making by nss_ldap. Login to mgt2 by user avb works ok.
Can anyone halp, please...
--
Alexandr Baskakov, Samara State Aerospace University
e-mail: avb at ssau.ru
More information about the torqueusers
mailing list