[torquedev] Multi-process checkpointing

Christopher Samuel samuel at unimelb.edu.au
Thu Jul 1 21:50:03 MDT 2010


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 29/06/10 23:36, Danny Sternkopf wrote:

> Please look at ./src/server/req_runjob.c line 1429:
> if (strcmp(prun->rq_destin, exec_host) != 0)
> 
> This comparison gives for a job sumbitted with -lnodes=1:ppn=1:
> 
> prun->rq_destin: htx5 vs. exec_host: htx5
> -> which is okay. Hosts are the same, no failure.
> 
> But the same comparison gives for a job sumbitted with -lnodes=1:ppn=2:
> 
> prun->rq_destin: htx5:ppn=2 vs. exec_host: htx5
> -> which is not the same and gives and failure.

Sounds like an excellent candidate for a Bugzilla entry to me!

http://www.clusterresources.com/bugzilla/

cheers!
Chris
- -- 
 Christopher Samuel - Senior Systems Administrator
 VLSCI - Victorian Life Sciences Computational Initiative
 Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
         http://www.vlsci.unimelb.edu.au/

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAkwtYesACgkQO2KABBYQAh82nQCfemQcX2FT9knoixOEtGGQsQKR
YHoAn3cZ/qmrevv24bCr2RbJxmpaE0iP
=NMT/
-----END PGP SIGNATURE-----
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torquedev/attachments/20100702/34c329a1/attachment.html 


More information about the torquedev mailing list