# Optimization of an LU Factorization Routine Using Communication/Computation Overlap

1 REMAP - Regularity and massive parallel computing
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : This report presents some works on the $LU$ factorization from the ScaLAPACK library. First, a complexity analysis is given. It allows to compute the optimal block size for the block scattered distribution used in ScaLAPACK. It also gives the communication phases that are interesting to overlap. Second, two optimizations based on computations/communications overlap are given with experimental results on Intel Paragon and IBM SP2 systems.
keyword :
Document type :
Reports
Domain :

https://hal.inria.fr/inria-00073597
Contributor : Rapport de Recherche Inria <>
Submitted on : Wednesday, May 24, 2006 - 1:17:15 PM
Last modification on : Monday, December 10, 2018 - 10:54:05 AM
Long-term archiving on : Sunday, April 4, 2010 - 9:36:24 PM

### Identifiers

• HAL Id : inria-00073597, version 1

### Citation

Frédéric Desprez, Stéphane Domas, Bernard Tourancheau. Optimization of an LU Factorization Routine Using Communication/Computation Overlap. [Research Report] RR-3094, INRIA. 1997. ⟨inria-00073597⟩

Record views