Global Resource Management for High Availability and Performance in a DSM-based Cluster

Christine Morin 1 Renaud Lottiaux 1
1 CAPS - Compilation, parallel architectures and system
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : High availability and performance are two desirable properties for the execution of long-running parallel scientific applications on software DSM based clusters. Global resource management in the operating system is a way to achieve these properties. To illustrate this approach, a system integrating a paged-based shared virtual memory and a parallel file system for global management of memory and disk resources is presented. Main design issues include the optimization of disk accesses in the context of a single level storage system and fault tolerance.
Type de document :
Rapport
[Research Report] RR-3694, INRIA. 1999
Liste complète des métadonnées

https://hal.inria.fr/inria-00072975
Contributeur : Rapport de Recherche Inria <>
Soumis le : mercredi 24 mai 2006 - 11:29:53
Dernière modification le : mercredi 11 avril 2018 - 01:51:01
Document(s) archivé(s) le : dimanche 4 avril 2010 - 21:11:50

Fichiers

Identifiants

  • HAL Id : inria-00072975, version 1

Citation

Christine Morin, Renaud Lottiaux. Global Resource Management for High Availability and Performance in a DSM-based Cluster. [Research Report] RR-3694, INRIA. 1999. 〈inria-00072975〉

Partager

Métriques

Consultations de la notice

233

Téléchargements de fichiers

101