Interpolation-restart strategies for resilient eigensolvers

Abstract : The solution of large eigenproblems is involved in many scientific and engineering applications when, for instance stability analysis is a concern. For large simulation in material physics or thermo-acoustics, the calculation can last for many hours on large parallel platforms. On future large-scale systems, the time interval between two consecutive faults is forecast to decrease so that many faults could occur during the solution of large eigenproblems. Consequently it becomes critical to design parallel eigensolvers which can survive faults. In that framework, we mainly investigate the relevance of approaches relying on numerical techniques that might be combined with more classical techniques for real large scale parallel implementations. Because we focus on numerical remedies we do not consider parallel implementations nor parallel experiments but only numerical experiments. We assume that a separate mechanism ensures the fault detection and that a system layer provides support for setting back the environment (processes, \ldots) in a running state. Once the system is in a running state, after a fault, our main objective is to provide robust resilient schemes so that the eigensolver may keep converging through the fault without restarting the calculation from scratch. For this purpose, we extend the interpolation-restart (IR) strategies introduced in a previous work for linear systems. For a given numerical scheme, the IR strategies consist in extracting relevant spectral information from available data after a fault. After data extraction, a well selected part of the missing data is regenerated through interpolation strategies to constitute meaningful input to restart the numerical algorithm. A main feature of this numerical remedy that it does not require extra resources, e.i., computational unit or computing time, when no fault occurs. In this paper, we revisit a few state-of-the-art methods for solving large sparse eigenvalue problems namely the Arnoldi methods, subspace iteration methods and the Jacobi-Davidson method, in the light of our IR strategies. For each considered eigensolver, we adapt the IR strategies to regenerate as much spectral information as possible. Through intensive numerical experiments, we illustrate the qualitative behavior of the resulting schemes when the number of faults and the amount of lost data are varied.
Type de document :
Rapport
[Research Report] 8625, INRIA Bordeaux. 2015
Liste complète des métadonnées


https://hal.inria.fr/hal-01201748
Contributeur : Mawussi Zounon <>
Soumis le : vendredi 2 octobre 2015 - 10:20:45
Dernière modification le : vendredi 16 septembre 2016 - 15:07:34
Document(s) archivé(s) le : dimanche 3 janvier 2016 - 10:34:37

Fichier

RR-VP.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01201748, version 2

Collections

Citation

Emmanuel Agullo, Luc Giraud, Pablo Salas, Mawussi Zounon. Interpolation-restart strategies for resilient eigensolvers. [Research Report] 8625, INRIA Bordeaux. 2015. <hal-01201748v2>

Partager

Métriques

Consultations de
la notice

364

Téléchargements du document

78