[torqueusers] torque requires --enable-debug to work
garrick at usc.edu
Fri Oct 14 20:10:20 MDT 2005
On Fri, Oct 14, 2005 at 07:01:20PM -0700, Garrick Staples alleged:
> On Wed, Oct 12, 2005 at 01:50:35PM +1000, Andrew Sharpe alleged:
> > Hi all,
> > I'm fairly new to torque but I think this is non-standard behaviour.
> > I've verified that this problem exists on
> > torque-1.2.0p4
> > torque-1.2.0p6
> > torque-1.2.0p7-snap.1127772314
> > using CentOS4.1 (full install) on x86_64.
> > The problem is that torque only works if I compile it with
> > --enable-debug. Here's the steps I follow to obtain the results:
> > NOTE: pbs is a CNAME to machine1, which correctly resolves to 10.1.1.12,
> > machine2 correctly resolves to 10.1.1.13 - by resolves I mean forward
> > and reverse lookups are ok.
> > 1. compile any of the above versions using the following commands
> > ./configure --prefix=/usr/local --set-default-server=pbs
> > --set-server-home=/var/spool/PBS --enable-server --enable-docs
> > --enable-mom --enable-clients --enable-syslog --disable-rpp
> > make
> > make install
> Thanks for the very good debugging info! Did you happen to check for
> anything in /var/log/messages? Since you configured with
> --enable-syslog, there might be some valuable info.
> What is the server tcp_timeout ('p s tcp_timeout' in qmgr)? That should
> be at least 6 or so.
> Please do 'set server log_level = 7' in qmgr and run 'momctl -q
> loglevel=7 -h machine1,machine2' and retry the tests. That will add
> additional info to the server and mom logs.
> Also try again without '--disable-rpp' (be sure to reinstall and
> restart server and moms).
Also check the mom jobstartblocktime (momctl -q jobstartblocktime -h
machine1,machine2). I suspect it would be really bad if
jobstartblocktime was larger than tcp_timeout.
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20051014/f77d0c18/attachment.bin
More information about the torqueusers