VMdeploy: Improving Best-Effort Job Management in Grid'5000

Jérôme Gallard 1, * Adrien Lèbre 2, 3 Oana Goga 4 Christine Morin 1
* Auteur correspondant
1 PARIS - Programming distributed parallel systems for large scale numerical simulation
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, ENS Cachan - École normale supérieure - Cachan, Inria Rennes – Bretagne Atlantique
4 RESO - Protocols and softwares for very high-performance network
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : Virtualization technologies have recently gained a lot of interest in Grid computing as they allow flexible resource management. Grid'5000 (G5K) is a French national Grid platform used for computer science research to experiment all layers of Grid software. Computer scientists reserve G5K nodes prior to their experiments. In G5K some low priority jobs are executed in best effort mode on the node idle time slots when the latter are not part of any reservation. However, best-effort jobs may be killed at any time by the Grid job scheduler when the nodes they use are subject to higher priority reservation. This behaviour leads potentially to a huge waste of compute time or at least requires users to deal with checkpoints of their best efforts jobs. In this paper, we describe the design and implementation of the VMdeploy framework which exploits virtual machines for executing best effort jobs in order to solve the best-effort issue in G5K platform. VMdeploy manages snapshots of the best effort jobs transparently to their users and thus ensures the progress of these jobs avoiding most of the waste of resources. Results of a preliminary experimental evaluation are presented. While designed in the context of G5K, VMdeploy can be used in combination of any job scheduler in clusters and grids.
Type de document :
Rapport
[Research Report] RR-6764, INRIA. 2008
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00346740
Contributeur : Jérôme Gallard <>
Soumis le : lundi 6 juillet 2009 - 11:27:38
Dernière modification le : mercredi 11 juillet 2018 - 07:48:36
Document(s) archivé(s) le : samedi 26 novembre 2016 - 10:34:42

Fichier

RR-6764.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00346740, version 3

Citation

Jérôme Gallard, Adrien Lèbre, Oana Goga, Christine Morin. VMdeploy: Improving Best-Effort Job Management in Grid'5000. [Research Report] RR-6764, INRIA. 2008. 〈inria-00346740v3〉

Partager

Métriques

Consultations de la notice

1284

Téléchargements de fichiers

356