[torqueusers] qsub: Bad UID for job execution

Greenseid, Joseph M (IS) Joseph.Greenseid at ngc.com
Mon Mar 29 12:00:17 MDT 2010


> At first I had /var/spool/pbs/server_name set to cluster.fing.edu.uy.
> Now I changed it to pbs_oscar but had no luck, I keep getting the "Bad
> UID for job execution" error.

cluster.fing.edu.uy and node01 (a.k.a. pbs_oscar) are just different interfaces on the same node, correct?
 
if so, when you run the command `/bin/hostname`, what do you get back?
 
my experience with oscar is that `/bin/hostname` needs to return the name that is associated with the private IP addr (the same interface as pbs_oscar, oscar_server, nfs_oscar); if your hostname is returning as cluster.fing.edu.uy, that could be the problem you're running into.
 
--Joe

________________________________

From: torqueusers-bounces at supercluster.org on behalf of Santiago Iturriaga
Sent: Sat 3/27/2010 10:16 PM
To: torqueusers at supercluster.org
Subject: Re: [torqueusers] qsub: Bad UID for job execution



/etc/hosts contains the following:

[siturria at cluster ~]$ cat /etc/hosts
# Do not remove the following line, or various programs
# that require network functionality will fail.
::1     localhost.localdomain   localhost
192.168.242.20  node20.cluster.fing     node20
192.168.242.19  node19.cluster.fing     node19
192.168.242.18  node18.cluster.fing     node18
192.168.242.17  node17.cluster.fing     node17
192.168.242.16  node16.cluster.fing     node16
192.168.242.15  node15.cluster.fing     node15
192.168.242.14  node14.cluster.fing     node14
192.168.242.13  node13.cluster.fing     node13
192.168.242.12  node12.cluster.fing     node12
192.168.242.11  node11.cluster.fing     node11
192.168.242.10  node10.cluster.fing     node10
192.168.242.9   node09.cluster.fing     node09
192.168.242.8   node08.cluster.fing     node08
192.168.242.7   node07.cluster.fing     node07
192.168.242.6   node06.cluster.fing     node06
192.168.242.5   node05.cluster.fing     node05
192.168.242.4   node04.cluster.fing     node04
192.168.242.3   node03.cluster.fing     node03
192.168.242.2   node02.cluster.fing     node02
192.168.242.1   node01.cluster.fing     node01  oscar_server   
nfs_oscar       pbs_oscar
164.73.47.186   cluster.fing.edu.uy     cluster

At first I had /var/spool/pbs/server_name set to cluster.fing.edu.uy.
Now I changed it to pbs_oscar but had no luck, I keep getting the "Bad
UID for job execution" error.


El 26/03/2010 12:17 p.m., Arnau Bria escribió:
> On Fri, 26 Mar 2010 11:31:34 +0000
> Santiago Iturriaga wrote:
>
> Hi Santiago,
>
> what's the content of /etc/hosts?
> Seems to me that torque is considering your server names as diff hosts,
> and confused about primary name server.
>
> *as simple test, try adding them to /etc/hosts.equiv. Not sure how R*
> commands will behave between diff hostnames in same host.
>
> HTH,
> Arnau
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>   

_______________________________________________
torqueusers mailing list
torqueusers at supercluster.org
http://www.supercluster.org/mailman/listinfo/torqueusers


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20100329/abc1fa24/attachment.html 


More information about the torqueusers mailing list