Modeling Large Compute Nodes with Heterogeneous Memories with Cache-Aware Roofline Model

Abstract : In order to fulfill modern applications needs, computing systems become more powerful, heterogeneous and complex. NUMA platforms and emerging high bandwidth memories offer new opportunities for performance improvements. However they also increase hardware and software complexity, thus making application performance analysis and optimization an even harder task. The Cache-Aware Roofline Model (CARM) is an insightful, yet simple model designed to address this issue. It provides feedback on potential applications bottlenecks and shows how far is the application performance from the achievable hardware upper-bounds. However, it does not encompass NUMA systems and next generation processors with heterogeneous memories. Yet, some application bottlenecks belong to those memory subsystems, and would benefit from the CARM insights. In this paper, we fill the missing requirements to scope recent large shared memory systems with the CARM. We provide the methodology to instantiate, and validate the model on a NUMA system as well as on the latest Xeon Phi processor equiped with configurable hybrid memory. Finally, we show the model ability to exhibits several bottlenecks of such systems, which were not supported by CARM.
Type de document :
Communication dans un congrès
High Performance Computing systems - Performance Modeling, Benchmarking, and Simulation - 8th International Workshop, PMBS 2017, Nov 2017, Denver (CO), United States. Springer, 10724, pp.91-113, 2017, Lecture Notes in Computer Science. 〈10.1007/978-3-319-72971-8_5〉
Liste complète des métadonnées

Littérature citée [20 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01622582
Contributeur : Nicolas Denoyelle <>
Soumis le : mardi 24 octobre 2017 - 14:55:52
Dernière modification le : mercredi 14 février 2018 - 14:39:14
Document(s) archivé(s) le : jeudi 25 janvier 2018 - 13:12:51

Fichiers

Paper.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Nicolas Denoyelle, Brice Goglin, Aleksandar Ilic, Emmanuel Jeannot, Leonel Sousa. Modeling Large Compute Nodes with Heterogeneous Memories with Cache-Aware Roofline Model. High Performance Computing systems - Performance Modeling, Benchmarking, and Simulation - 8th International Workshop, PMBS 2017, Nov 2017, Denver (CO), United States. Springer, 10724, pp.91-113, 2017, Lecture Notes in Computer Science. 〈10.1007/978-3-319-72971-8_5〉. 〈hal-01622582〉

Partager

Métriques

Consultations de la notice

208

Téléchargements de fichiers

123