Skip to Main content Skip to Navigation
Conference papers

Modeling Large Compute Nodes with Heterogeneous Memories with Cache-Aware Roofline Model

Abstract : In order to fulfill modern applications needs, computing systems become more powerful, heterogeneous and complex. NUMA platforms and emerging high bandwidth memories offer new opportunities for performance improvements. However they also increase hardware and software complexity, thus making application performance analysis and optimization an even harder task. The Cache-Aware Roofline Model (CARM) is an insightful, yet simple model designed to address this issue. It provides feedback on potential applications bottlenecks and shows how far is the application performance from the achievable hardware upper-bounds. However, it does not encompass NUMA systems and next generation processors with heterogeneous memories. Yet, some application bottlenecks belong to those memory subsystems, and would benefit from the CARM insights. In this paper, we fill the missing requirements to scope recent large shared memory systems with the CARM. We provide the methodology to instantiate, and validate the model on a NUMA system as well as on the latest Xeon Phi processor equiped with configurable hybrid memory. Finally, we show the model ability to exhibits several bottlenecks of such systems, which were not supported by CARM.
Complete list of metadata

Cited literature [17 references]  Display  Hide  Download
Contributor : Nicolas Denoyelle Connect in order to contact the contributor
Submitted on : Tuesday, October 24, 2017 - 2:55:52 PM
Last modification on : Friday, January 21, 2022 - 3:10:07 AM
Long-term archiving on: : Thursday, January 25, 2018 - 1:12:51 PM


Files produced by the author(s)




Nicolas Denoyelle, Brice Goglin, Aleksandar Ilic, Emmanuel Jeannot, Leonel Sousa. Modeling Large Compute Nodes with Heterogeneous Memories with Cache-Aware Roofline Model. High Performance Computing systems - Performance Modeling, Benchmarking, and Simulation - 8th International Workshop, PMBS 2017, Nov 2017, Denver (CO), United States. pp.91-113, ⟨10.1007/978-3-319-72971-8_5⟩. ⟨hal-01622582⟩



Les métriques sont temporairement indisponibles