On the Overhead of Topology Discovery for Locality-aware Scheduling in HPC

Brice Goglin 1, 2
Abstract : The increasing complexity of parallel computing platforms requires a deep knowledge of the hardware and of the application needs. Locality a key criteria for performance optimization. It involves software tools to expose information about the hardware topology to high performance runtime libraries. We show that the overhead of gathering such information from the operating system is significant on large computing nodes that run Linux. This overhead also increases more than linearly with the number of processes that perform it simultaneously. We then study the actual needs of the HPC software ecosystem in terms of topology information. We propose some ways to avoid multiple expensive topology discovery and to share topology information between components such as the resource manager or the runtime libraries.
Type de document :
Communication dans un congrès
Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP2017), Mar 2017, St Petersburg, Russia. IEEE Computer Society, pp.9, 2017, Proceedings of the 25th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP2017). <http://pdp2017.org/>
Liste complète des métadonnées


https://hal.inria.fr/hal-01402755
Contributeur : Brice Goglin <>
Soumis le : samedi 3 décembre 2016 - 21:37:08
Dernière modification le : jeudi 15 juin 2017 - 09:08:50
Document(s) archivé(s) le : jeudi 23 mars 2017 - 00:35:04

Fichier

article.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01402755, version 2

Collections

Citation

Brice Goglin. On the Overhead of Topology Discovery for Locality-aware Scheduling in HPC. Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP2017), Mar 2017, St Petersburg, Russia. IEEE Computer Society, pp.9, 2017, Proceedings of the 25th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP2017). <http://pdp2017.org/>. <hal-01402755v2>

Partager

Métriques

Consultations de
la notice

405

Téléchargements du document

98