Saline: Improving Best-Effort Job Management in Grids

Jérôme Gallard 1 Adrien Lèbre 2, 3 Christine Morin 1
1 MYRIADS - Design and Implementation of Autonomous Distributed Systems
IRISA-D1 - SYSTÈMES LARGE ÉCHELLE, Inria Rennes – Bretagne Atlantique
2 ASCOLA - Aspect and composition languages
LINA - Laboratoire d'Informatique de Nantes Atlantique, Département informatique - EMN, Inria Rennes – Bretagne Atlantique
Abstract : Although virtualization technologies have recently gained a lot of interest in Grid computing as they allow flexible resource management, the most common way to exploit grids still relies on dedicated services like resource management systems (RMSs) to get resources at a particular time. To improve resource usage, most of these systems provide a best-effort mode where lowest priority jobs can be executed when resources are idle. This particular mode does not provide any guarantee of service and jobs may be killed at any time by the RMS when the nodes they use are subject to higher priority reservations. This behaviour potentially leads to a huge waste of computation time or at least requires users to deal with checkpoints of their best-effort jobs. In this paper, we present Saline, a generic and non-intrusive framework to manage best-effort jobs at grid level through virtual machines (VMs) usage. We discuss the main challenges concerning the design of such a grid system, focusing on VM snapshot management and network configuration. Results of preliminary experiments show the interest of our proposal to ensure an efficient execution of best-effort jobs through the whole grid.
Type de document :
Communication dans un congrès
PDP 2010: The 18th Euromicro International Conference on Parallel, Distributed and Network-Based Computing -- Special Session: Virtualization in Distributed Systems, 2010, Pisa, Italy. 2010, 〈http://doi.ieeecomputersociety.org/10.1109/PDP.2010.61〉
Liste complète des métadonnées

https://hal.inria.fr/inria-00426373
Contributeur : Jérôme Gallard <>
Soumis le : dimanche 25 octobre 2009 - 18:01:05
Dernière modification le : mardi 16 janvier 2018 - 15:54:19

Identifiants

  • HAL Id : inria-00426373, version 1

Citation

Jérôme Gallard, Adrien Lèbre, Christine Morin. Saline: Improving Best-Effort Job Management in Grids. PDP 2010: The 18th Euromicro International Conference on Parallel, Distributed and Network-Based Computing -- Special Session: Virtualization in Distributed Systems, 2010, Pisa, Italy. 2010, 〈http://doi.ieeecomputersociety.org/10.1109/PDP.2010.61〉. 〈inria-00426373〉

Partager

Métriques

Consultations de la notice

514