Abstract : Checkpointing is a technique that is often employed for granting fault tolerance for applications executing in failure-prone environments. It consists on regularly saving the application’s state in another and fault independent storage such that if the application fails, it can be continued without necessarily restarting it. In this context, fixing the checkpointing frequency is an important topic which we address in this paper. We particularly address this issue considering hybrid fault tolerance and variable size checkpoint dumps. We then evaluate our solution and compare it with state of the art models, and show that our solution brings better results.
https://hal.inria.fr/hal-01789964 Contributor : Hal IfipConnect in order to contact the contributor Submitted on : Friday, May 11, 2018 - 3:11:20 PM Last modification on : Friday, May 11, 2018 - 3:13:29 PM Long-term archiving on: : Tuesday, September 25, 2018 - 5:09:23 PM
Samy Sadi, Belabbas yagoubi. On the Optimum Checkpointing Interval Selection for Variable Size Checkpoint Dumps. 5th International Conference on Computer Science and Its Applications (CIIA), May 2015, Saida, Algeria. pp.599-610, ⟨10.1007/978-3-319-19578-0_49⟩. ⟨hal-01789964⟩