[torqueusers] need help with 1.2.0p1 snapshot testing

Garrick Staples garrick at usc.edu
Mon Feb 14 23:29:09 MST 2005


On Tue, Feb 15, 2005 at 01:41:40PM +1100, Chris Samuel alleged:
> On Tue, 15 Feb 2005 12:13 pm, Garrick Staples wrote:
> 
> Hi Garrick,
> 
> First off, thanks so much for all this work!
> 
> > The latest snapshot, 1.2.0p1-snap.1108426118, has folded in some big
> > changes. I've been banging on these patches for a few weeks and everything
> > seems pretty solid to me, but it could really use some wider testing.
> 
> A couple of quick questions:
> 
> 1) Will this inherit queued jobs from earlier Torque releases ?
> 2) What about running jobs ?
> 
> I only ask as I remember you mentioning something about changes to the job 
> files to allow you to do the mpiexec restart stuff...

Queued, yes.  Running, no.  Earlier versions don't save the necessary info to
properly preserve the tm state.  The fixes in the new code have as much to do
with _saving_ as _recovery_.

-- 
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20050214/4af2eee3/attachment.bin


More information about the torqueusers mailing list