[torquedev] [Bug 161] New: pbs_server, init.d hangs when creating serverdb.
bugzilla-daemon at supercluster.org
bugzilla-daemon at supercluster.org
Sat Oct 8 16:06:22 MDT 2011
http://www.clusterresources.com/bugzilla/show_bug.cgi?id=161
Summary: pbs_server, init.d hangs when creating serverdb.
Product: TORQUE
Version: 3.0.x
Platform: PC
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P5
Component: pbs_server
AssignedTo: dbeer at adaptivecomputing.com
ReportedBy: steve.traylen at cern.ch
CC: torquedev at supercluster.org
Estimated Hours: 0.0
Hi,
Reported
http://www.supercluster.org/pipermail/torqueusers/2011-May/012828.html
and
https://bugzilla.redhat.com/show_bug.cgi?id=744138
steps to reproduce
1. /etc/init.d/pbs_server stop
2. rm -f /var/torque/server_priv/serverdb
3. /etc/init.d/pbs_server start
The init.d script is left in a "sleep 1" loop
$PBS_DAEMON -d $PBS_HOME -t create &
while [ ! -r $PBS_SERVERDB ]; do
sleep 1
done
killproc pbs_server
RET=$?
in particular the serverdb is never actually written to disk until
the pbs_server is killed so the loop lasts forever.
Better would be if the pbs_server could close the filehandle on
the serverdb or something.
Following on from the comments in the mail thread, the init.d script
should either be successful or if the serverdb file fails then
the init.d script should just fail with a message and return code
of 5 telling you what to do.... As a trivial fix changing the above
code to:
$PBS_DAEMON -d $PBS_HOME -t create &
sleep 5
killproc pbs_server
RET=$?
does the job but will not handle an error case very well.
Steve.
Steve.
--
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
More information about the torquedev
mailing list