[torqueusers] Can't submit job from remote submit host

Clotho Tsang wytsang at clustertech.com
Mon Apr 22 00:46:48 MDT 2013


I have found the same problem, it can be solved by
using "submit_hosts = mgt1", remove the domainname.

http://osdir.com/ml/clustering.torque.user/2008-03/msg00014.html


On 4 April 2012 01:20, Alexandr Baskakov <avb at ssau.ru> wrote:

>  I have submit_hosts in my server settings.
> ...
> submit_hosts = mgt1.ssc
> ...
> Anyway, creating /etc/hosts.equiv with mgt1.ssc+ leads to the same result.
> qsub: Bad UID for job execution MSG=ruserok failed validating avb/avb from
> mgt1
>
> 03.04.2012 19:12, David Beer пишет:
>
> What does your /etc/hosts.equiv file look like? You may need an entry that
> is something like:
>
>  <remote_hostname> +
> or
> <remote_hostname> <list of users that can run client commands there>
>
>  The + just allows all users. Note that this check is in addition to your
> TORQUE settings so having a plus there doesn't make everyone a manager or
> anything crazy like that.
>
>  David
>
>  On Fri, Mar 30, 2012 at 11:18 PM, Alexandr Baskakov <avb at ssau.ru> wrote:
>
>> Hi, All.
>>
>> I'am trying to submit job from submit host to remote server with torque.
>>
>> Have 2 nodes:
>> mgt1 - torque client
>> mgt2 - torque server and moab.
>>
>> Domain: ssc
>>
>> On mgt2:
>> [mgt2 ~]$ qmgr -c 'l s'
>> Server mgt2
>>         server_state = Active
>>         scheduling = True
>>         total_jobs = 0
>>         state_count = Transit:0 Queued:0 Held:0 Waiting:0 Running:0
>> Exiting:0
>>         acl_hosts = localhost,mgt2,mgt1
>>         managers = root at mgt2,torque at mgt2
>>         operators = root at mgt2,torque at mgt2
>>         default_queue = batch
>>         log_events = 511
>>         mail_from = adm
>>         query_other_jobs = True
>>         resources_assigned.ncpus = 0
>>         resources_assigned.nodect = 0
>>         scheduler_iteration = 600
>>         node_check_rate = 150
>>         tcp_timeout = 6
>>         log_level = 7
>>         mom_job_sync = True
>>         pbs_version = 3.0.2
>>         keep_completed = 300
>>         submit_hosts = mgt1.ssc
>>         next_job_number = 57
>>         net_counter = 2 0 0
>>
>> When I trying to submit job from mgt1 by:
>>
>> [mgt1 ~]$ PBS_DEFAULT=mgt2 qsub
>> hostname
>> qsub: Bad UID for job execution MSG=ruserok failed validating avb/avb
>> from mgt1
>>
>> have an error.
>>
>> On mgt2, in logfile:
>> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
>> AuthenticateUser from avb
>> 03/26/2012 15:52:57;0100;PBS_Server;Req;;Type AuthenticateUser request
>> received from avb at mgt1.ssc, sock=14
>> 03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching
>> request AuthenticateUser on sd=14
>> 03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request
>> type AuthenticateUser on socket 14
>> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
>> Disconnect from PBS_Server
>> 03/26/2012 15:52:57;0080;PBS_Server;Req;dis_request_read;decoding command
>> QueueJob from avb
>> 03/26/2012 15:52:57;0100;PBS_Server;Req;;Type QueueJob request received
>> from avb at mgt1.ssc, sock=13
>> 03/26/2012 15:52:57;0008;PBS_Server;Job;dispatch_request;dispatching
>> request QueueJob on sd=13
>> 03/26/2012 15:52:57;0080;PBS_Server;Job;62.mgt2;removed job file
>> 03/26/2012 15:52:57;0080;PBS_Server;Req;req_reject;Reject reply
>> code=15025(Bad UID for job execution MSG=ruserok failed validating avb/avb
>> from mgt1), aux=0, type=QueueJob, from avb at mgt1.ssc
>> 03/26/2012 15:52:57;0008;PBS_Server;Job;reply_send;Reply sent for request
>> type QueueJob on socket 13
>>
>> Authentication on mgt1,mgt2 making by nss_ldap. Login to mgt2 by user avb
>> works ok.
>>
>> Can anyone halp, please...
>>
>> --
>> Alexandr Baskakov, Samara State Aerospace University
>> e-mail: avb at ssau.ru
>>
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>
>
>
>
>  --
>  David Beer | Software Engineer
> Adaptive Computing
>
>
>
>  _______________________________________________
> torqueusers mailing listtorqueusers at supercluster.orghttp://www.supercluster.org/mailman/listinfo/torqueusers
>
>
>  --
> Alexandr Baskakov, Samara State Aerospace University
> e-mail:	avb at ssau.ru
>
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>


-- 
Clotho Tsang
Senior Software Engineer
Cluster Technology Limited
Email: clotho at clustertech.com
Tel: (852) 2655-6129
Fax: (852) 2994-2101
Website: www.clustertech.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20130422/4ced903e/attachment.html 


More information about the torqueusers mailing list