[torquedev] [Bug 161] New: pbs_server, init.d hangs when creating serverdb.

bugzilla-daemon at supercluster.org bugzilla-daemon at supercluster.org
Sat Oct 8 16:06:22 MDT 2011


http://www.clusterresources.com/bugzilla/show_bug.cgi?id=161

           Summary: pbs_server, init.d hangs when creating serverdb.
           Product: TORQUE
           Version: 3.0.x
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: P5
         Component: pbs_server
        AssignedTo: dbeer at adaptivecomputing.com
        ReportedBy: steve.traylen at cern.ch
                CC: torquedev at supercluster.org
   Estimated Hours: 0.0


Hi,

Reported 
http://www.supercluster.org/pipermail/torqueusers/2011-May/012828.html
and
https://bugzilla.redhat.com/show_bug.cgi?id=744138

steps to reproduce

1. /etc/init.d/pbs_server stop
2. rm -f /var/torque/server_priv/serverdb 
3. /etc/init.d/pbs_server start

The init.d script is left in  a "sleep 1" loop 

$PBS_DAEMON -d $PBS_HOME -t create &
    while [ ! -r $PBS_SERVERDB ]; do
        sleep 1
    done
    killproc pbs_server
    RET=$?

in particular the serverdb is never actually written to disk until
the pbs_server is killed so the loop lasts forever.


Better would be if the pbs_server could close the filehandle on
the serverdb or something.

Following on from the comments in the mail thread, the init.d script
should either be successful or if the serverdb file fails then
the init.d script should just fail with a message and return code
of 5 telling you what to do.... As a trivial fix changing the above
code to:

   $PBS_DAEMON -d $PBS_HOME -t create &
   sleep 5
   killproc pbs_server
   RET=$?

does the job but will not handle an error case very well.

Steve.





Steve.

-- 
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


More information about the torquedev mailing list