Robustness of the Young/Daly formula for stochastic iterative applications - Archive ouverte HAL Access content directly
Conference Papers Year :

## Robustness of the Young/Daly formula for stochastic iterative applications

(1, 2) , (1, 2) , (3) , (1, 2)
1
2
3
Yishu Du
• Function : Author
• PersonId : 1082929
Loris Marchal
Guillaume Pallez
Yves Robert

#### Abstract

The Young/Daly formula for periodic checkpointing is known to hold for a divisible load application where one can checkpoint at any time-step. In an nutshell, the optimal period is $P YD = 2µ f C$ where µ f is the Mean Time Between Failures (MTBF) and C is the checkpoint time. This paper assesses the accuracy of the formula for applications decomposed into computational iterations where: (i) the duration of an iteration is stochastic, i.e., obeys a probability distribution law D of mean µ D ; and (ii) one can checkpoint only at the end of an iteration. We first consider static strategies where checkpoints are taken after a given number of iterations k and provide a closed-form, asymptotically optimal, formula for k, valid for any distribution D. We then show that using the Young/Daly formula to compute $k (as k • µ D = P YD)$ is a first order approximation of this formula. We also consider dynamic strategies where one decides to checkpoint at the end of an iteration only if the total amount of work since the last checkpoint exceeds a threshold W th , and otherwise proceed to the next iteration. Similarly, we provide a closed-form formula for this threshold and show that P YD is a first-order approximation of W th. Finally, we provide an extensive set of simulations where D is either Uniform, Gamma or truncated Normal, which shows the global accuracy of the Young/Daly formula, even when the distribution D had a large standard deviation (and when one cannot use a first-order approximation). Hence we establish that the relevance of the formula goes well beyond its original framework.

#### Domains

Computer Science [cs]

### Dates and versions

hal-03024618 , version 1 (30-11-2020)

### Identifiers

• HAL Id : hal-03024618 , version 1
• DOI :

### Cite

Yishu Du, Loris Marchal, Guillaume Pallez, Yves Robert. Robustness of the Young/Daly formula for stochastic iterative applications. ICPP 2020 - 49th International Conference on Parallel Processing, Aug 2020, Edmonton / Virtual, Canada. pp.1-11, ⟨10.1145/3404397.3404418⟩. ⟨hal-03024618⟩

### Export

BibTeX TEI Dublin Core DC Terms EndNote Datacite

55 View