[torqueusers] Can't submit job from remote submit host
David Beer
dbeer at adaptivecomputing.com
Tue Apr 3 09:12:16 MDT 2012
What does your /etc/hosts.equiv file look like? You may need an entry that
is something like:
<remote_hostname> +
or
<remote_hostname> <list of users that can run client commands there>
The + just allows all users. Note that this check is in addition to your
TORQUE settings so having a plus there doesn't make everyone a manager or
anything crazy like that.
David
On Fri, Mar 30, 2012 at 11:18 PM, Alexandr Baskakov <avb at ssau.ru> wrote:
> Hi, All.
>
> I'am trying to submit job from submit host to remote server with torque.
>
> Have 2 nodes:
> mgt1 - torque client
> mgt2 - torque server and moab.
>
> Domain: ssc
>
> On mgt2:
> [mgt2 ~]$ qmgr -c 'l s'
> Server mgt2
> server_state = Active
> scheduling = True
> total_jobs = 0
> state_count = Transit:0 Queued:0 Held:0 Waiting:0 Running:0
> Exiting:0
> acl_hosts = localhost,mgt2,mgt1
> managers = root at mgt2,torque at mgt2
> operators = root at mgt2,torque at mgt2
> default_queue = batch
> log_events = 511
> mail_from = adm
> query_other_jobs = True
> resources_assigned.ncpus = 0
> resources_assigned.nodect = 0
> scheduler_iteration = 600
> node_check_rate = 150
> tcp_timeout = 6
> log_level = 7
> mom_job_sync = True
> pbs_version = 3.0.2
> keep_completed = 300
> submit_hosts = mgt1.ssc
> next_job_number = 57
> net_counter = 2 0 0
>
> When I trying to submit job from mgt1 by:
>
> [mgt1 ~]$ PBS_DEFAULT=mgt2 qsub
> hostname
> qsub: Bad UID for job execution MSG=ruserok failed validating avb/avb from
> mgt1
>
> have an error.
>
> On mgt2, in logfile:
> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
> AuthenticateUser from avb
> 03/26/2012 15:52:57;0100;PBS_Server;Req;;Type AuthenticateUser request
> received from avb at mgt1.ssc, sock=14
> 03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching
> request AuthenticateUser on sd=14
> 03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request
> type AuthenticateUser on socket 14
> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
> Disconnect from PBS_Server
> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
> QueueJob from avb
> 03/26/2012 15:52:57;0100;PBS_Server;Req;;Type QueueJob request received
> from avb at mgt1.ssc, sock=13
> 03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching
> request QueueJob on sd=13
> 03/26/2012 15:52:57;0080;PBS_Server;Job;62.mgt2;removed job file
> 03/26/2012 15:52:57;0080;PBS_Server;Req;req_reject;Reject reply
> code=15025(Bad UID for job execution MSG=ruserok failed validating avb/avb
> from mgt1), aux=0, type=QueueJob, from avb at mgt1.ssc
> 03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request
> type QueueJob on socket 13
>
> Authentication on mgt1,mgt2 making by nss_ldap. Login to mgt2 by user avb
> works ok.
>
> Can anyone halp, please...
>
> --
> Alexandr Baskakov, Samara State Aerospace University
> e-mail: avb at ssau.ru
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
--
David Beer | Software Engineer
Adaptive Computing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20120403/1247030a/attachment.html
More information about the torqueusers
mailing list