[torqueusers] torque 2.3.4 problem with mom_priv/confg and server_name

Bas van der Vlies basv at sara.nl
Mon Nov 17 01:18:10 MST 2008


> 
> ==== 2.1.11 version ===
> 11/14/2008 09:05:42;0002;   pbs_mom;n/a;initialize;independent
> 11/14/2008 09:05:42;0002;   pbs_mom;Svr;pbs_mom;Is up
> 11/14/2008 09:05:42;0002;   pbs_mom;Svr;mom_main;MOM executable path
> and mtime at launch: /usr/sbin/pbs_mom 1226649791
> 11/14/2008 09:05:42;0002;   pbs_mom;n/a;mom_main;hello sent to server
> ib-batch-ng.irc.sara.nl
> 
> 
> === 2.3.4  version === (Note is sends two hello's)
> 
> 11/14/2008 09:04:24;0002;   pbs_mom;n/
> a;mom_server_check_connection;sending hello to server ib-batch-
> ng.irc.sara.nl
> 11/14/2008 09:04:24;0002;   pbs_mom;n/
> a;mom_server_check_connection;sending hello to server batch-
> ng.irc.sara.nl
> 11/14/2008 09:04:24;0002;   pbs_mom;Svr;im_eof;End of File from addr
> 192.168.144.4:15001
> 11/14/2008 09:04:24;0002;   pbs_mom;Svr;im_eof;End of File from addr
> 192.168.144.4:15001
> 

I found the problem and do not know if this fix is correct. The
mom_server_count is introduced but never set when a pbs server is found. So
pbs_mom wil always use the name set PBS_HOME/server_name and always ignore
the $pbsserver directive in mom_priv/config. Maybe we must set
mom_server_count in mom_server_add function

--- mom_main.c.org	2008-11-17 09:03:29.579546323 +0100
+++ mom_main.c	2008-11-17 09:04:41.527388470 +0100
@@ -1662,7 +1662,8 @@

   log_record(PBSEVENT_SYSTEM, PBS_EVENTCLASS_SERVER, id, value);

-
+  /* HvB */
+  mom_server_count =  1;
   return(mom_server_add(value));
   }  /* END setpbsserver() */


-- 
********************************************************************
*  Bas van der Vlies                    e-mail: basv at sara.nl       *
*  SARA - Academic Computing Services   Amsterdam, The Netherlands *
********************************************************************


More information about the torqueusers mailing list