Memory Footprint of Locality Information on Many-Core Platforms

Brice Goglin 1
1 TADAAM - Topology-Aware System-Scale Data Management for High-Performance Computing
LaBRI - Laboratoire Bordelais de Recherche en Informatique, Inria Bordeaux - Sud-Ouest
Abstract : Exploiting the power of HPC platforms requires knowledge of their increasingly complex hardware topologies. Multiple components of the software stack, for instance MPI implementations or OpenMP runtimes, now perform their own topology discovery to find out the available cores and memory, and to better place tasks based on their affinities. We study in this article the impact of this topology discovery in terms of memory footprint. Storing locality information wastes an amount of physical memory that is becoming an issue on many-core platforms on the road to exascale. We demonstrate that this information may be factorized between processes by using a shared-memory region. Our implementation in hwloc and Open MPI shows a memory footprint that does not increase with the number of MPI ranks per node anymore. Moreover the job launch time is decreased by more than a factor of 2 on an Intel Knights Landing Xeon Phi and on a 96-core NUMA platform.
Liste complète des métadonnées
Contributeur : Brice Goglin <>
Soumis le : mardi 21 novembre 2017 - 23:30:22
Dernière modification le : jeudi 11 janvier 2018 - 06:27:21


  • HAL Id : hal-01644087, version 1



Brice Goglin. Memory Footprint of Locality Information on Many-Core Platforms. 2017. 〈hal-01644087〉



Consultations de la notice