F. Richard, . Barrett, S. Paul, . Crozier, . Dw-doerfler et al., Assessing the role of mini-applications in predicting key performance characteristics of scientific and engineering applications, Journal of Parallel and Distributed Computing, vol.75, pp.107-122, 2015.

F. Broquedis, J. Clet-ortega, S. Moreaud, N. Furmento, B. Goglin et al., hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, 2010.
DOI : 10.1109/PDP.2010.67

URL : https://hal.archives-ouvertes.fr/inria-00429889

B. Goglin, Exposing the Locality of Heterogeneous Memory Architectures to HPC Applications, Proceedings of the Second International Symposium on Memory Systems , MEMSYS '16, 2016.
DOI : 10.1145/2989081.2989115

URL : https://hal.archives-ouvertes.fr/hal-01330194

A. Ilic, F. Pratas, and L. Sousa, Cache-aware Roofline model: Upgrading the loft, IEEE Computer Architecture Letters, vol.13, issue.1, pp.21-24, 2014.
DOI : 10.1109/L-CA.2013.6

K. Kim, K. Kim, and Q. Park, Performance analysis and optimization of three-dimensional FDTD on GPU using roofline model, Computer Physics Communications, vol.182, issue.6, pp.1201-1207, 2011.
DOI : 10.1016/j.cpc.2011.01.025

Y. J. Lo, S. Williams, B. Van-straalen, T. J. Ligocki, M. J. Cordery et al., Roofline Model Toolkit: A Practical Tool for Architectural and Program Analysis, pp.129-148, 2015.
DOI : 10.1007/978-3-319-17248-4_7

D. John and . Mccalpin, Stream benchmark. Link: www. cs. virginia. edu/stream/ref, p.22, 1995.

J. Philip, S. Mucci, C. Browne, G. Deane, and . Ho, Papi: A portable interface to hardware performance counters, Proceedings of the department of defense HPCMP users group conference, pp.7-10, 1999.

D. Rossinelli, C. Conti, and P. Koumoutsakos, Mesh-particle interpolations on graphics processing units and multicore central processing units, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol.52, issue.1944, pp.3692164-2175, 1944.
DOI : 10.1146/annurev.fluid.37.061903.175753

A. Sodan, Multi Core Trends in High Performance Computing. https://www.sics.se/sites

R. V. Van-nieuwpoort and J. W. Romein, Using many-core hardware to correlate radio astronomy signals, Proceedings of the 23rd international conference on Conference on Supercomputing, ICS '09, pp.440-449, 2009.
DOI : 10.1145/1542275.1542337

S. Williams, A. Waterman, and D. Patterson, Roofline, Communications of the ACM, vol.52, issue.4, pp.65-76, 2009.
DOI : 10.1145/1498765.1498785