Skip to Main content Skip to Navigation
New interface
Reports (Research report)

Impact of fault prediction on checkpointing strategies

Guillaume Aupy 1, 2 Yves Robert 1, 2 Frédéric Vivien 1, 2 Dounia Zaidouni 2, 1 
Abstract : This paper deals with the impact of fault prediction techniques on checkpointing strategies. We extend the classical analysis of Young and Daly in the presence of a fault prediction system, which is characterized by its recall and its precision, and which provides either exact or window-based time predictions. We succeed in deriving the optimal value of the checkpointing period (thereby minimizing the waste of resource usage due to checkpoint overhead) in all scenarios. These results allow to analytically assess the key parameters that impact the performance of fault predictors at very large scale. In addition, the results of this analytical evaluation are nicely corroborated by a comprehensive set of simulations, thereby demonstrating the validity of the model and the accuracy of the results.
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download
Contributor : Guillaume Pallez (Aupy) Connect in order to contact the contributor
Submitted on : Monday, October 8, 2012 - 4:00:38 PM
Last modification on : Wednesday, October 26, 2022 - 8:16:25 AM
Long-term archiving on: : Friday, December 16, 2016 - 9:49:28 PM


Files produced by the author(s)


  • HAL Id : hal-00720401, version 2


Guillaume Aupy, Yves Robert, Frédéric Vivien, Dounia Zaidouni. Impact of fault prediction on checkpointing strategies. [Research Report] RR-8023, INRIA. 2012. ⟨hal-00720401v2⟩



Record views


Files downloads