Kadeploy3: Efficient and Scalable Operating System Provisioning for HPC Clusters

Emmanuel Jeanvoine 1 Luc Sarzyniec 1 Lucas Nussbaum 1
1 ALGORILLE - Algorithms for the Grid
Inria Nancy - Grand Est, LORIA - NSS - Department of Networks, Systems and Services
Abstract : Operating system provisioning is a common and critical task in cluster computing environments. The required low-level operations involved in provisioning can drastically decrease the performance of a given solution, and maintaining a reasonable provisioning time on clusters of 1000+ nodes is a significant challenge. We present Kadeploy3, a tool built to efficiently and reliably deploy a large number of cluster nodes. Since it is a keystone of the Grid'5000 experimental testbed, it has been designed not only to help system administrators install and manage clusters but also to provide testbed users with a flexible way to deploy their own operating systems on nodes for their own experimentation needs, on a very frequent basis. In this paper we detail the design principles of Kadeploy3 and its main features, and evaluate its capabilities in several contexts. We also share the lessons we have learned during the design and deployment of Kadeploy3 in the hope that this will help system administrators and developers of similar solutions.
Type de document :
[Research Report] RR-8002, INRIA. 2012
Liste complète des métadonnées

Littérature citée [3 références]  Voir  Masquer  Télécharger

Contributeur : Lucas Nussbaum <>
Soumis le : jeudi 21 juin 2012 - 12:05:45
Dernière modification le : mardi 18 décembre 2018 - 16:26:02
Document(s) archivé(s) le : jeudi 15 décembre 2016 - 17:26:42


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-00710638, version 1


Emmanuel Jeanvoine, Luc Sarzyniec, Lucas Nussbaum. Kadeploy3: Efficient and Scalable Operating System Provisioning for HPC Clusters. [Research Report] RR-8002, INRIA. 2012. 〈hal-00710638〉



Consultations de la notice


Téléchargements de fichiers