A New Approach to Configurable Dynamic Scheduling in Clusters based on Single System Image Technologies

Geoffroy Vallée 1 Christine Morin 1 Jean-Yves Berthou 2 Louis Rilling 1
1 PARIS - Programming distributed parallel systems for large scale numerical simulation
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, ENS Cachan - École normale supérieure - Cachan, Inria Rennes – Bretagne Atlantique
Abstract : Clusters are now considered as an alternative to parallel machines to execute workloads made up of sequential and/or parallel applications. For efficient application execution on clusters, dynamic global process scheduling is of prime importance. Different dynamic scheduling policies that have been studied for distributed systems or parallel machines may be used in clusters. The choice of a particular policy depends on the kind of workload to be executed. In a cluster, it is thus highly desirable to implement a configurable global scheduler to be able to adapt the dynamic scheduling policy to the workload characteristics, to take benefit of all cluster resources and tocope with node shutdown and reboot. In this paper, we present the architecture of the global scheduler and the process management mechanisms of Kerrighed, a single system image operating system designed for high performance computing on clusters. Kerrighed provides a development framework allowing to easily implement dynamic scheduling policies without kernel modification. In Kerrighed, the global scheduling policy can be dynamically changed while applications execute on the cluster. Kerrighed's process management mechanisms allow to easily deploy parallelapplications in the cluster and to efficiently migrate or checkpoint processes, including processes sharing memory. Kerrighed has been implemented as a set of modules extending Linux kernel. Preliminary performance results are presented.
Document type :
Reports
Complete list of metadatas

https://hal.inria.fr/inria-00071785
Contributor : Rapport de Recherche Inria <>
Submitted on : Tuesday, May 23, 2006 - 6:46:32 PM
Last modification on : Friday, November 16, 2018 - 1:23:01 AM
Long-term archiving on : Sunday, April 4, 2010 - 10:37:51 PM

Identifiers

  • HAL Id : inria-00071785, version 1

Citation

Geoffroy Vallée, Christine Morin, Jean-Yves Berthou, Louis Rilling. A New Approach to Configurable Dynamic Scheduling in Clusters based on Single System Image Technologies. [Research Report] RR-4801, INRIA. 2003. ⟨inria-00071785⟩

Share

Metrics

Record views

531

Files downloads

276