Resilience at extreme scale : system level, algorithmic level or both?

Luc Giraud; Franck Cappello

Communication Dans Un Congrès Année : 2013

Resilience at extreme scale : system level, algorithmic level or both?

(1, 2) , (3, 4, 5)

1
2
3
4
5

Luc Giraud

Fonction : Auteur
PersonId : 8816
IdHAL : luc-giraud
ORCID : 0000-0002-7062-7672
IdRef : 074267418

High-End Parallel Algorithms for Challenging Numerical Simulations

Laboratoire Bordelais de Recherche en Informatique

Franck Cappello

Fonction : Auteur
PersonId : 828491

Global parallel and distributed computing

Joint Laboratory for Petascale Computing [Illinois]

Laboratoire de Recherche en Informatique

Résumé

Resilience is a critical problem for extreme scale numerical simulations. The most credible solution is still based on checkpoint/restart with its high overheads or hardware cost. It has been shown recently that some algorithmic approaches and some code characteristics can help reducing these costs through combined system-algorithmic/application approaches. However, we are still looking for a right solution to this simple question: how to reduce simultaneously and significantly state saving and recovery times?

Domaines

Modélisation et simulation

Luc Giraud : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00799309

Soumis le : jeudi 25 juillet 2013-12:57:13

Dernière modification le : mercredi 20 mars 2024-17:52:16

Dates et versions

hal-00799309 , version 1 (25-07-2013)

Identifiants

HAL Id : hal-00799309 , version 1

Citer

Luc Giraud, Franck Cappello. Resilience at extreme scale : system level, algorithmic level or both?. SIAM Conference on Computational Science and Engineering (SIAM CSE 2013), Feb 2013, Boston, United States. ⟨hal-00799309⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-LILLE3 CNRS INRIA UMR8623 INRIA2 TDS-MACS UNIV-PARIS-SACLAY

205 Consultations

0 Téléchargements

Resilience at extreme scale : system level, algorithmic level or both?

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager