Optimization of an LU Factorization Routine Using Communication/Computation Overlap

Frédéric Desprez 1 Stéphane Domas 1 Bernard Tourancheau 1
1 REMAP - Regularity and massive parallel computing
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : This report presents some works on the $LU$ factorization from the ScaLAPACK library. First, a complexity analysis is given. It allows to compute the optimal block size for the block scattered distribution used in ScaLAPACK. It also gives the communication phases that are interesting to overlap. Second, two optimizations based on computations/communications overlap are given with experimental results on Intel Paragon and IBM SP2 systems.
Document type :
Reports
Complete list of metadatas

https://hal.inria.fr/inria-00073597
Contributor : Rapport de Recherche Inria <>
Submitted on : Wednesday, May 24, 2006 - 1:17:15 PM
Last modification on : Monday, December 10, 2018 - 10:54:05 AM
Long-term archiving on : Sunday, April 4, 2010 - 9:36:24 PM

Identifiers

  • HAL Id : inria-00073597, version 1

Collections

Citation

Frédéric Desprez, Stéphane Domas, Bernard Tourancheau. Optimization of an LU Factorization Routine Using Communication/Computation Overlap. [Research Report] RR-3094, INRIA. 1997. ⟨inria-00073597⟩

Share

Metrics

Record views

250

Files downloads

413