Locality Optimization on a NUMA Architecture for Hybrid LU Factorization

Adrien Rémy; Marc Baboulin; Masha Sosonkina; Brigitte Rozoy

doi:10.3233/978-1-61499-381-0-153

Chapitre D'ouvrage Année : 2014

Locality Optimization on a NUMA Architecture for Hybrid LU Factorization

(1, 2) , (1, 2) , (3) , (2)

1
2
3

Adrien Rémy

Fonction : Auteur
PersonId : 916954

Performance Optimization by Software Transformation and Algorithms & Librairies Enhancement

Systèmes parallèles (LRI)

Marc Baboulin

Fonction : Auteur
PersonId : 16585
IdHAL : marc-baboulin
IdRef : 105979163

Performance Optimization by Software Transformation and Algorithms & Librairies Enhancement

Systèmes parallèles (LRI)

Masha Sosonkina

Fonction : Auteur

Old Dominion University [Norfolk]

Brigitte Rozoy

Fonction : Auteur

Systèmes parallèles (LRI)

Résumé

We study the impact of non-uniform memory accesses (NUMA) on the solution of dense general linear systems using an LU factorization algorithm. In particular we illustrate how an appropriate placement of the threads and memory on a NUMA architecture can improve the performance of the panel factorization and consequently accelerate the global LU factorization. We apply these placement strategies and present performance results for a hybrid multicore/GPU LU algorithm as it is implemented in the public domain library MAGMA.

Mots clés

LU factorization thread placement ccNUMA dense linear systems MAGMA library

Domaines

Analyse numérique [cs.NA]

Marc Baboulin : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00987284

Soumis le : lundi 5 mai 2014-19:17:53

Dernière modification le : mercredi 14 février 2024-03:09:33

Dates et versions

hal-00987284 , version 1 (05-05-2014)

Identifiants

HAL Id : hal-00987284 , version 1
DOI : 10.3233/978-1-61499-381-0-153

Citer

Adrien Rémy, Marc Baboulin, Masha Sosonkina, Brigitte Rozoy. Locality Optimization on a NUMA Architecture for Hybrid LU Factorization. Parallel Computing: Accelerating Computational Science and Engineering, 25, pp.153-162, 2014, Advances in Parallel Computing, ⟨10.3233/978-1-61499-381-0-153⟩. ⟨hal-00987284⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-PARSYS UNIV-PARIS-SACLAY LISN LISN-PARSYS

90 Consultations

0 Téléchargements

Locality Optimization on a NUMA Architecture for Hybrid LU Factorization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager