On Using the Roofline Model with Lower Bounds on Data Movement

Abstract : The roofline model is a popular approach to ``bounds and bottleneck'' performance analysis. It focuses on the limits to performance of processors because of limited bandwidth to off-chip memory. It models upper bounds on performance as a function of operational intensity, the ratio of computational operations per byte of data moved from/to memory. While operational intensity can be directly measured for a specific implementation of an algorithm on a particular target platform, it is of interest to obtain broader insights on bottlenecks, where various semantically equivalent implementations of an algorithm are considered, along with analysis for variations in architectural parameters. This is currently very cumbersome and requires performance modeling and analysis of many variants. In this paper, we alleviate this problem by using the roofline model in conjunction with upper bounds on the operational intensity of computations as a function of cache capacity, derived using lower bounds on data movement. This enables bottleneck analysis that holds across all dependence-preserving semantically equivalent implementations of an algorithm. We demonstrate the utility of the approach in in assessing fundamental limits to performance and energy efficiency for several benchmark algorithms across a design space of architectural variations.
Type de document :
Article dans une revue
ACM Transactions on Architecture and Code Optimization (TACO) , ACM, 2015, 11 (4), pp.67:1--67:23
Liste complète des métadonnées

https://hal.inria.fr/hal-01104765
Contributeur : Fabrice Rastello <>
Soumis le : lundi 19 janvier 2015 - 11:10:18
Dernière modification le : lundi 30 avril 2018 - 15:02:01

Identifiants

  • HAL Id : hal-01104765, version 1

Citation

Venmugil Elango, Naser Sedaghati, Fabrice Rastello, Louis-Noël Pouchet, Jagannathan Ramanujam, et al.. On Using the Roofline Model with Lower Bounds on Data Movement. ACM Transactions on Architecture and Code Optimization (TACO) , ACM, 2015, 11 (4), pp.67:1--67:23. 〈hal-01104765〉

Partager

Métriques

Consultations de la notice

595