[torqueusers] Can't submit job from remote submit host

Alexandr Baskakov avb at ssau.ru
Fri Mar 30 23:18:33 MDT 2012


Hi, All.

I'am trying to submit job from submit host to remote server with torque.

Have 2 nodes:
mgt1 - torque client
mgt2 - torque server and moab.

Domain: ssc

On mgt2:
[mgt2 ~]$ qmgr -c 'l s'
Server mgt2
         server_state = Active
         scheduling = True
         total_jobs = 0
         state_count = Transit:0 Queued:0 Held:0 Waiting:0 Running:0 Exiting:0
         acl_hosts = localhost,mgt2,mgt1
         managers = root at mgt2,torque at mgt2
         operators = root at mgt2,torque at mgt2
         default_queue = batch
         log_events = 511
         mail_from = adm
         query_other_jobs = True
         resources_assigned.ncpus = 0
         resources_assigned.nodect = 0
         scheduler_iteration = 600
         node_check_rate = 150
         tcp_timeout = 6
         log_level = 7
         mom_job_sync = True
         pbs_version = 3.0.2
         keep_completed = 300
         submit_hosts = mgt1.ssc
         next_job_number = 57
         net_counter = 2 0 0

When I trying to submit job from mgt1 by:

[mgt1 ~]$ PBS_DEFAULT=mgt2 qsub
hostname
qsub: Bad UID for job execution MSG=ruserok failed validating avb/avb from mgt1

have an error.

On mgt2, in logfile:
03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command AuthenticateUser from avb
03/26/2012 15:52:57;0100;PBS_Server;Req;;Type AuthenticateUser request received from avb at mgt1.ssc, sock=14
03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching request AuthenticateUser on sd=14
03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request type AuthenticateUser on socket 14
03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command Disconnect from PBS_Server
03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command QueueJob from avb
03/26/2012 15:52:57;0100;PBS_Server;Req;;Type QueueJob request received from avb at mgt1.ssc, sock=13
03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching request QueueJob on sd=13
03/26/2012 15:52:57;0080;PBS_Server;Job;62.mgt2;removed job file
03/26/2012 15:52:57;0080;PBS_Server;Req;req_reject;Reject reply code=15025(Bad UID for job execution MSG=ruserok failed validating avb/avb from mgt1), aux=0, type=QueueJob, from avb at mgt1.ssc
03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request type QueueJob on socket 13

Authentication on mgt1,mgt2 making by nss_ldap. Login to mgt2 by user avb works ok.

Can anyone halp, please...

-- 
Alexandr Baskakov, Samara State Aerospace University
e-mail:	avb at ssau.ru



More information about the torqueusers mailing list