Optimization of an LU Factorization Routine Using Communication/Computation Overlap

Frédéric Desprez 1 Stéphane Domas 1 Bernard Tourancheau 1
1 REMAP - Regularity and massive parallel computing
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : This report presents some works on the $LU$ factorization from the ScaLAPACK library. First, a complexity analysis is given. It allows to compute the optimal block size for the block scattered distribution used in ScaLAPACK. It also gives the communication phases that are interesting to overlap. Second, two optimizations based on computations/communications overlap are given with experimental results on Intel Paragon and IBM SP2 systems.
Type de document :
Rapport
[Research Report] RR-3094, INRIA. 1997
Liste complète des métadonnées

https://hal.inria.fr/inria-00073597
Contributeur : Rapport de Recherche Inria <>
Soumis le : mercredi 24 mai 2006 - 13:17:15
Dernière modification le : mardi 16 janvier 2018 - 15:43:15
Document(s) archivé(s) le : dimanche 4 avril 2010 - 21:36:24

Fichiers

Identifiants

  • HAL Id : inria-00073597, version 1

Collections

Citation

Frédéric Desprez, Stéphane Domas, Bernard Tourancheau. Optimization of an LU Factorization Routine Using Communication/Computation Overlap. [Research Report] RR-3094, INRIA. 1997. 〈inria-00073597〉

Partager

Métriques

Consultations de la notice

217

Téléchargements de fichiers

237