Scalable Shared Memory Multiprocessors: Some Ideas to Make them Reliable

Michel Banâtre 1 Alain Gefflaut 1 Christine Morin 1
1 LSP - Langages et Systèmes Parallèles
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires
Abstract : Scalable shared memory multiprocessors are promising architectures to achieve teraflops computational power. As they contain a large number of processor and memory elements, such machines have a high probability of failure. In this paper, we investigate an approach based on backward error recovery to provide a highly available scalable shared memory architecture tolerating transient and permanent processor and memory failures.
Complete list of metadatas

https://hal.inria.fr/hal-01275700
Contributor : Christine Morin <>
Submitted on : Thursday, February 18, 2016 - 8:25:44 AM
Last modification on : Friday, November 16, 2018 - 1:26:19 AM

Identifiers

  • HAL Id : hal-01275700, version 1

Citation

Michel Banâtre, Alain Gefflaut, Christine Morin. Scalable Shared Memory Multiprocessors: Some Ideas to Make them Reliable. Proceedings of the Workshop on Hardware and Software Architectures for Fault Tolerance: Perpectives and Towards a Synthesis, 1993, Le Mont Saint-Michel, France. ⟨hal-01275700⟩

Share

Metrics

Record views

242