[torqueusers] Re: FW: HPUX 11 failure torque 2.0.0p1, 2, 3 and 1.2.0p6

Garrick Staples garrick at usc.edu
Thu Dec 15 13:23:46 MST 2005


On Thu, Dec 15, 2005 at 01:29:29PM -0600, Mike Coyne alleged:
> There are some issues regarding HPUX and torque in versions after
> 1.2.0p5 surrounding pbs_iff on the client and server side.  On the
> client side , src/lib/Netlib/net_client.c
> 
>  
> 
> Below is a diff between 2.0.0.p3 and 1.2.0.p5 , in order to get pbs_iff
> to connect from a remote host( one of the mom clients) I had to backport
> the older version of this file ..

The bits with tv_sec and select() don't look important to me.

The important part might be the size of 'one'.  I'm thinking it should
be an int, not a long.  Can you try just that one change in p3?

@@ -177,7 +172,7 @@ int client_to_svr(
   int                sock;
   unsigned short     tryport;
   int                flags;
-  int                one = 1;
+  long               one = 1;
   
   local.sin_family = AF_INET;
   local.sin_addr.s_addr = 0;


The arguments changes to setsockopt() appears correct to me, especially
the last argument.

 
> In order to get src/resmom/hpux11 (or hpux10) / mom_mach.c  to compile I
> added 
> 
>  
> 
>  extern  int     ignwalltime;

Ouch.  Fixed in CVS.


> The remaining problem is  as follows,  
> 
> Pbs_iff  dis connects with invalid credential ==>PBSE_BADCRED in
> src/server/process_request.c from 

This would imply the bind() to a priviledged port isn't working.

Do you have bindresvport() on HPUX?


 
> The output from gdb's server_conn has a suspious cn_addr  the connection
> was from a qstat on the same host as the server ?  although this may be
> fallout from a previous authentication error ?

> (gdb) print svr_conn[sfds]
> 
> $1 = {cn_addr = 2147483649, cn_handle = -1, cn_port = 40696, cn_authen =
> 0, 

cn_port should probably be less than 1024 at that point.

-- 
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20051215/35ef81ae/attachment.bin


More information about the torqueusers mailing list