[torqueusers] torque 2.3.4 problem with mom_priv/confg
and server_name
Bas van der Vlies
basv at sara.nl
Mon Nov 17 01:18:10 MST 2008
>
> ==== 2.1.11 version ===
> 11/14/2008 09:05:42;0002; pbs_mom;n/a;initialize;independent
> 11/14/2008 09:05:42;0002; pbs_mom;Svr;pbs_mom;Is up
> 11/14/2008 09:05:42;0002; pbs_mom;Svr;mom_main;MOM executable path
> and mtime at launch: /usr/sbin/pbs_mom 1226649791
> 11/14/2008 09:05:42;0002; pbs_mom;n/a;mom_main;hello sent to server
> ib-batch-ng.irc.sara.nl
>
>
> === 2.3.4 version === (Note is sends two hello's)
>
> 11/14/2008 09:04:24;0002; pbs_mom;n/
> a;mom_server_check_connection;sending hello to server ib-batch-
> ng.irc.sara.nl
> 11/14/2008 09:04:24;0002; pbs_mom;n/
> a;mom_server_check_connection;sending hello to server batch-
> ng.irc.sara.nl
> 11/14/2008 09:04:24;0002; pbs_mom;Svr;im_eof;End of File from addr
> 192.168.144.4:15001
> 11/14/2008 09:04:24;0002; pbs_mom;Svr;im_eof;End of File from addr
> 192.168.144.4:15001
>
I found the problem and do not know if this fix is correct. The
mom_server_count is introduced but never set when a pbs server is found. So
pbs_mom wil always use the name set PBS_HOME/server_name and always ignore
the $pbsserver directive in mom_priv/config. Maybe we must set
mom_server_count in mom_server_add function
--- mom_main.c.org 2008-11-17 09:03:29.579546323 +0100
+++ mom_main.c 2008-11-17 09:04:41.527388470 +0100
@@ -1662,7 +1662,8 @@
log_record(PBSEVENT_SYSTEM, PBS_EVENTCLASS_SERVER, id, value);
-
+ /* HvB */
+ mom_server_count = 1;
return(mom_server_add(value));
} /* END setpbsserver() */
--
********************************************************************
* Bas van der Vlies e-mail: basv at sara.nl *
* SARA - Academic Computing Services Amsterdam, The Netherlands *
********************************************************************
More information about the torqueusers
mailing list