Scheduling deadline constrained checkpointing on virtual clusters

Blaise Yenke 1 Jean-François Mehaut 2 Maurice Tchuenté 3
1 Laboratoire d'Informatique de Grenoble
LIG - Laboratoire d'Informatique de Grenoble
2 MESCAL - Middleware efficiently scalable
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : We consider a context where the available resources of the Intranet of a company are used as a virtual cluster for scientific computation, during the idle periods (nights, weekends, holidays, ). Generally, these idle periods do not permit to carry out completely the computations. For instance, a workstation mobilized during the night must be released in the morning to make it available for the employee, even if the application running on it is not completed. It is therefore necessary to save the context of uncompleted applications for possible restart. Hereafter, we assume that the computations running on the workstations are independent from each other. The checkpointing mechanism which ensures the continuity of applications is subject to resource constraints : the network bandwidth, the disk bandwidth and the delay T imposed for releasing the workstations. We first show that the designing of a scheduling strategy which optimizes resource consumption while taking into account the above constraints, can be formalized as a variant of the classical 0/1 knapsack problem. We then propose an algorithm whose implementation does not have a significant overhead on checkpointing mechanisms. Experiments carried out on a real cluster show that this algorithm performs better than the naive scheduling algorithm which selects the applications one after the other in order of decreasing amount of resource consumption.
Complete list of metadatas

https://hal.inria.fr/hal-00953619
Contributor : Arnaud Legrand <>
Submitted on : Friday, February 28, 2014 - 2:11:41 PM
Last modification on : Tuesday, July 9, 2019 - 1:26:58 AM

Identifiers

Collections

Citation

Blaise Yenke, Jean-François Mehaut, Maurice Tchuenté. Scheduling deadline constrained checkpointing on virtual clusters. Proceedings of the IEEE Asia-Pacific Services Computing Conference (APSCC), 2008, Yilan, Taiwan. pp.257 - 264, ⟨10.1109/APSCC.2008.131⟩. ⟨hal-00953619⟩

Share

Metrics

Record views

304