Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters

Geoffroy Vallée 1 Renaud Lottiaux 1 David Margery 1 Christine Morin 1 Jean-Yves Berthou 2
1 PARIS - Programming distributed parallel systems for large scale numerical simulation
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, ENS Cachan - École normale supérieure - Cachan, Inria Rennes – Bretagne Atlantique
Abstract : Today, clusters are widely used to execute numerical applications. Mechanisms are needed to ease cluster use and to take advantage of the cluster distributed resources. Process management mechanisms are very useful in this respect. Process duplication is needed for parallel application deployment, dynamic load balancing relies on process migration and process checkpoint/restart is needed to tolerate node failures during the execution of long-running applications. A kernel level approach allows to efficiently implement all these mechanisms. However, they are very complex to implement and to maintain. Nevertheless they are all based on a common concept: process virtualization which provides a mean to extract the state of a process from the operating system executed on each cluster node. This paper presents a process virtualization mechanism called ghost process, implemented at kernel level which can be used by system programmers to easily implement various process management mechanisms. The ghost process mechanism has been implemented in Kerrighed single system image cluster operating system based on Linux. It has been used to efficiently and easily implement process duplication, migration and checkpoint/restart in Kerrighed.
Document type :
Reports
Complete list of metadatas

https://hal.inria.fr/inria-00070496
Contributor : Rapport de Recherche Inria <>
Submitted on : Friday, May 19, 2006 - 8:39:51 PM
Last modification on : Friday, November 16, 2018 - 1:30:11 AM
Long-term archiving on : Sunday, April 4, 2010 - 9:21:04 PM

Identifiers

  • HAL Id : inria-00070496, version 1

Citation

Geoffroy Vallée, Renaud Lottiaux, David Margery, Christine Morin, Jean-Yves Berthou. Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters. [Research Report] RR-5510, INRIA. 2005, pp.16. ⟨inria-00070496⟩

Share

Metrics

Record views

525

Files downloads

146