Where can I find the fault tolerance development work?
The end-to-end MPI message data reliability work is being actively developed on the subversion trunk (i.e., reliable message passing over unreliable networks). See this FAQ entry for more details. The coordinated checkpoint and restart process fault tolerance work is currently available on the Open MPI development trunk. For more information about how to use this feature see this WIKI page: https://svn.open-mpi.org/trac/ompi/wiki/ProcessFT_CR. 5. Does Open MPI support end-to-end data reliability in MPI message passing? The current release of Open MPI does not support end-to-end data reliability in message passing any more than the underlying network already guarantees. Future releases of Open MPI will include explicit data reliability support (i.e., more functionality than is provided by the underlying network). Specifically, the data reliability (“dr”) PML component (available on the trunk, but not yet in a stable release) assumes that the underlying network is unreliable. It can drop
The end-to-end MPI message data reliability work is being actively developed on the subversion trunk (i.e., reliable message passing over unreliable networks). See this FAQ entry for more details. The coordinated checkpoint and restart process fault tolerance work is currently available on the Open MPI development trunk and in the v1.3 release series. For more information about how to use this feature see this WIKI page: https://svn.open-mpi.org/trac/ompi/wiki/ProcessFT_CR. 5. Does Open MPI support end-to-end data reliability in MPI message passing? The current release of Open MPI does not support end-to-end data reliability in message passing any more than the underlying network already guarantees. Future releases of Open MPI will include explicit data reliability support (i.e., more functionality than is provided by the underlying network). Specifically, the data reliability (“dr”) PML component (available on the trunk, but not yet in a stable release) assumes that the underlying net