Kadeploy3: Efficient and Scalable Operating System Provisioning for HPC Clusters - Archive ouverte HAL Access content directly
Reports (Research Report) Year : 2012

Kadeploy3: Efficient and Scalable Operating System Provisioning for HPC Clusters

(1) , (1) , (1)
1
Emmanuel Jeanvoine
  • Function : Author
  • PersonId : 847304
Luc Sarzyniec
  • Function : Author
  • PersonId : 925508
Lucas Nussbaum

Abstract

Operating system provisioning is a common and critical task in cluster computing environments. The required low-level operations involved in provisioning can drastically decrease the performance of a given solution, and maintaining a reasonable provisioning time on clusters of 1000+ nodes is a significant challenge. We present Kadeploy3, a tool built to efficiently and reliably deploy a large number of cluster nodes. Since it is a keystone of the Grid'5000 experimental testbed, it has been designed not only to help system administrators install and manage clusters but also to provide testbed users with a flexible way to deploy their own operating systems on nodes for their own experimentation needs, on a very frequent basis. In this paper we detail the design principles of Kadeploy3 and its main features, and evaluate its capabilities in several contexts. We also share the lessons we have learned during the design and deployment of Kadeploy3 in the hope that this will help system administrators and developers of similar solutions.
Fichier principal
Vignette du fichier
RR-8002.pdf (1010.46 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00710638 , version 1 (21-06-2012)

Identifiers

  • HAL Id : hal-00710638 , version 1

Cite

Emmanuel Jeanvoine, Luc Sarzyniec, Lucas Nussbaum. Kadeploy3: Efficient and Scalable Operating System Provisioning for HPC Clusters. [Research Report] RR-8002, INRIA. 2012. ⟨hal-00710638⟩
595 View
499 Download

Share

Gmail Facebook Twitter LinkedIn More