[torqueusers] Can't submit job from remote submit host

David Beer dbeer at adaptivecomputing.com
Tue Apr 3 09:12:16 MDT 2012


What does your /etc/hosts.equiv file look like? You may need an entry that
is something like:

<remote_hostname> +
or
<remote_hostname> <list of users that can run client commands there>

The + just allows all users. Note that this check is in addition to your
TORQUE settings so having a plus there doesn't make everyone a manager or
anything crazy like that.

David

On Fri, Mar 30, 2012 at 11:18 PM, Alexandr Baskakov <avb at ssau.ru> wrote:

> Hi, All.
>
> I'am trying to submit job from submit host to remote server with torque.
>
> Have 2 nodes:
> mgt1 - torque client
> mgt2 - torque server and moab.
>
> Domain: ssc
>
> On mgt2:
> [mgt2 ~]$ qmgr -c 'l s'
> Server mgt2
>         server_state = Active
>         scheduling = True
>         total_jobs = 0
>         state_count = Transit:0 Queued:0 Held:0 Waiting:0 Running:0
> Exiting:0
>         acl_hosts = localhost,mgt2,mgt1
>         managers = root at mgt2,torque at mgt2
>         operators = root at mgt2,torque at mgt2
>         default_queue = batch
>         log_events = 511
>         mail_from = adm
>         query_other_jobs = True
>         resources_assigned.ncpus = 0
>         resources_assigned.nodect = 0
>         scheduler_iteration = 600
>         node_check_rate = 150
>         tcp_timeout = 6
>         log_level = 7
>         mom_job_sync = True
>         pbs_version = 3.0.2
>         keep_completed = 300
>         submit_hosts = mgt1.ssc
>         next_job_number = 57
>         net_counter = 2 0 0
>
> When I trying to submit job from mgt1 by:
>
> [mgt1 ~]$ PBS_DEFAULT=mgt2 qsub
> hostname
> qsub: Bad UID for job execution MSG=ruserok failed validating avb/avb from
> mgt1
>
> have an error.
>
> On mgt2, in logfile:
> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
> AuthenticateUser from avb
> 03/26/2012 15:52:57;0100;PBS_Server;Req;;Type AuthenticateUser request
> received from avb at mgt1.ssc, sock=14
> 03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching
> request AuthenticateUser on sd=14
> 03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request
> type AuthenticateUser on socket 14
> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
> Disconnect from PBS_Server
> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
> QueueJob from avb
> 03/26/2012 15:52:57;0100;PBS_Server;Req;;Type QueueJob request received
> from avb at mgt1.ssc, sock=13
> 03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching
> request QueueJob on sd=13
> 03/26/2012 15:52:57;0080;PBS_Server;Job;62.mgt2;removed job file
> 03/26/2012 15:52:57;0080;PBS_Server;Req;req_reject;Reject reply
> code=15025(Bad UID for job execution MSG=ruserok failed validating avb/avb
> from mgt1), aux=0, type=QueueJob, from avb at mgt1.ssc
> 03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request
> type QueueJob on socket 13
>
> Authentication on mgt1,mgt2 making by nss_ldap. Login to mgt2 by user avb
> works ok.
>
> Can anyone halp, please...
>
> --
> Alexandr Baskakov, Samara State Aerospace University
> e-mail: avb at ssau.ru
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>



-- 
David Beer | Software Engineer
Adaptive Computing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20120403/1247030a/attachment.html 


More information about the torqueusers mailing list