Topology and affinity aware hierarchical and distributed load-balancing in Charm++

Abstract : The evolution of massively parallel supercomputers make palpable two issues in particular: the load imbalance and the poor management of data locality in applications. Thus, with the increase of the number of cores and the drastic decrease of amount of memory per core, the large performance needs imply to particularly take care of the load-balancing and as much as possible of the locality of data. One mean to take into account this locality issue relies on the placement of the processing entities and load balancing techniques are relevant in order to improve application performance. With large-scale platforms in mind, we developed a hierarchical and distributed algorithm which aim is to perform a topology-aware load balancing tailored for Charm++ applications. This algorithm is based on both LibTopoMap for the network awareness aspects and on TREEMATCH to determine a relevant placement of the processing entities. We show that the proposed algorithm improves the overall execution time in both the cases of real applications and a synthetic benchmark as well. For this last experiment, we show a scalability up to one millions processing entities.
Type de document :
Communication dans un congrès
1st Workshop on Optimization of Communication in HPC runtime systems (IEEE COM-HPC16), Nov 2016, Salt-Lake City, United States
Liste complète des métadonnées

Littérature citée [30 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01394748
Contributeur : Emmanuel Jeannot <>
Soumis le : lundi 14 novembre 2016 - 11:23:18
Dernière modification le : jeudi 11 janvier 2018 - 06:27:21
Document(s) archivé(s) le : mercredi 15 mars 2017 - 04:23:55

Fichier

tmlb-paper.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01394748, version 1

Collections

Citation

Emmanuel Jeannot, Guillaume Mercier, François Tessier. Topology and affinity aware hierarchical and distributed load-balancing in Charm++. 1st Workshop on Optimization of Communication in HPC runtime systems (IEEE COM-HPC16), Nov 2016, Salt-Lake City, United States. 〈hal-01394748〉

Partager

Métriques

Consultations de la notice

142

Téléchargements de fichiers

103