[torquedev] torque+blcr+openmpi

Peter Kruse pk at q-leap.com
Tue Jul 6 03:54:42 MDT 2010


Hi Rishi,

rishi pathak wrote:
> Hi Danny,
>                   Is there a need for checkpointing mpirun/mpiexec
> processes(Please correct me if I am wrong). They are spawning MPI program on
> defined nodes. For restarting a checkpointed MPI program, a fresh instance
> of mpirun, mpiexec or pbsdsh can be used.

exactly, this is how we use and see it.  You submit a new job but with
the same node geometry and can then restart the same job.

Regards,

Peter


More information about the torquedev mailing list