[mvapich-discuss] support fault tolerance

Raghunath rajachan at cse.ohio-state.edu
Mon Feb 27 22:43:36 EST 2012


Hi Alexandr,

Thank you for posting your question to the list.
Using the existing BLCR-assisted checkpointing mechanism supported by
MVAPICH2, you should be able to save checkpoints
of your MPI-2 standard based application that uses MPI_GET() and MPI_PUT()
operations, and use these checkpoints
to restart your application at a later point. Do let us know if you run
into any issues while trying to use the
checkpoint/restart system.

Thanks,
--
Raghu


2012/2/27 Александр Твеленев <santvel at mail.ru>

> Hello group.
> I want to install MVAPICH2 and BLCR for fault tolerance support.
> Whether there will be checkpoints and to happen recovery from them at
> usage of one-sided communications such as "MPI_GET ()" and "MPI_PUT ()"?
> Are there any other ways to provide fault tolerance for MPI-2?
>
> Kind regards, Alexandr.
>
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20120227/fb31328a/attachment.html


More information about the mvapich-discuss mailing list