[torquedev] [Bug 169] New: Documentation on "Job Checkpoint and Restart" is outdated

bugzilla-daemon at supercluster.org bugzilla-daemon at supercluster.org
Thu Dec 15 11:25:59 MST 2011


http://www.clusterresources.com/bugzilla/show_bug.cgi?id=169

           Summary: Documentation on "Job Checkpoint and Restart" is
                    outdated
           Product: TORQUE
           Version: 2.5.x
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: P5
         Component: Documentation
        AssignedTo: knielson at adaptivecomputing.com
        ReportedBy: thzeiser at gmail.com
                CC: torquedev at supercluster.org
   Estimated Hours: 0.0


The example scripts for mom_priv/blcr_checkpoint_script and
mom_priv/blcr_restart_script given on
http://www.adaptivecomputing.com/resources/docs/torque/2-5-9/2.6jobcheckpoint.php
no longer work. E.g. there is now an additional argument $groupId" to both
scripts. Moreover, the scripts included in the directory contrib/blcr of the
sources are much more advanced as they for example include this fix but also
dropping privileges). Thus, the examples on the web page should be either
updated or substituted by a reference to the contrib/blcr directory.

However, the number and the type of the arguments given to both scripts should
be described on the web page (missing currently). Moreover, the mom_priv/config
options $checkpoint_interval, etc. are not at all listed on
http://www.adaptivecomputing.com/resources/docs/torque/2-5-9/a.cmomconfig.php.
In particular it would be worth mentioning what the default for
$checkpoint_interval is and if a value of zero e.g. means no automatic
checkpoints.

-- 
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


More information about the torquedev mailing list