[torquedev] Multi-process checkpointing
Christopher Samuel
samuel at unimelb.edu.au
Thu Jul 1 21:50:03 MDT 2010
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On 29/06/10 23:36, Danny Sternkopf wrote:
> Please look at ./src/server/req_runjob.c line 1429:
> if (strcmp(prun->rq_destin, exec_host) != 0)
>
> This comparison gives for a job sumbitted with -lnodes=1:ppn=1:
>
> prun->rq_destin: htx5 vs. exec_host: htx5
> -> which is okay. Hosts are the same, no failure.
>
> But the same comparison gives for a job sumbitted with -lnodes=1:ppn=2:
>
> prun->rq_destin: htx5:ppn=2 vs. exec_host: htx5
> -> which is not the same and gives and failure.
Sounds like an excellent candidate for a Bugzilla entry to me!
http://www.clusterresources.com/bugzilla/
cheers!
Chris
- --
Christopher Samuel - Senior Systems Administrator
VLSCI - Victorian Life Sciences Computational Initiative
Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
http://www.vlsci.unimelb.edu.au/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAkwtYesACgkQO2KABBYQAh82nQCfemQcX2FT9knoixOEtGGQsQKR
YHoAn3cZ/qmrevv24bCr2RbJxmpaE0iP
=NMT/
-----END PGP SIGNATURE-----
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torquedev/attachments/20100702/34c329a1/attachment.html
More information about the torquedev
mailing list