[torqueusers] Checkpointing and restart with torque 2.4 with BLCR
rajiv.care at gmail.com
Mon Mar 29 23:16:17 MDT 2010
I ve tried with blcr-0.7.3-18721 torque 2.4.2 and followed the installation
steps give in Torque administration manual
I ve configured everything & added the configuration scripts for checkpoint
I ve installed both the blcr & torque as root user and while testing the
check point facility with qhold .the job remains in running state only. Im
getting the following error message in */var/log/messages*
*Mar 30 10:42:12 gcluster checkpoint_script: Invoked:
/var/spool/torque/mom_priv/blcr_checkpoint_script 24472 0.gcluster.grid
Mar 30 10:42:12 gcluster checkpoint_script: Usage:
Mar 30 10:42:12 gcluster pbs_mom: LOG_ERROR::blcr_checkpoint_job, checkpoint
script returned value 255
Pls help me to solve this issue.. Or can u share the exact document u ve
followed to configure the check pointing facility
On Mon, Mar 29, 2010 at 6:30 PM, Alexander Oltu <Alexander.Oltu at uni.no>wrote:
> We have checkpointing working with blcr-0.7.3-18721 and torque 2.4.5.
> On Sat, 27 Mar 2010 16:22:13 +0530
> Rajiv Rajaian wrote:
> > Hi all.
> > Have any body tried Check pointing feature of torque 2.4 with
> > BLCR.??
> > If so please let me know the exact version of torque & BLCR that
> > supports the check pointing feature.
> > Thanks in advance
> > Rajiv. R
> > Project Associate,
> > CARE ,MIT,
> > Anna university
> Alexander Oltu
> System Engineer
> Parallab, Uni BCCS
> N-5008 Bergen, Norway
> phone: +47 55584144
> torqueusers mailing list
> torqueusers at supercluster.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the torqueusers